Manager | DevOps Engineering | Information Technology in New York, NY
The goal of Information Technology (IT) is to provide information resources and services to accelerate and support scientific research at The Rockefeller University. Our department is responsible for cyberinfrastructure, high-performance computing, bioinformatics, decision support, data analytics, cybersecurity, and the scientific and administrative software environment at the university. The department is in the midst of a major transition in culture, including the adoption of DevOps and Agile practices.
We seek a Cloud and Container Engineering Manager/Architect passionate about infrastructure-as-code and all things automation. You will help design, build, and maintain the tooling for configuration management, automated provisioning, and monitoring, and help evangelize best practices in DevOps and technical innovation. You will advocate for a DevOps and Agile approach in product development and demonstrate best practices by example.
Maintain and advance our real-time performance monitoring system (checkmk)
Maintain our GitLab repository and CI/CD infrastructure
Maintain and advance our on-prem and cloud-hosted Docker clusters
Lead improvements in the processes for building and deployment of servers (bare metal, VM, and cloud) and/or applications
Ensure rigor and reproducibility
Assist in maintaining and improving our Ansible-based configuration management system; promote DevOps practices and culture via mentoring across engineering, business and scientific teams to drive technical innovation
Weigh in, educate, and advocate for scalable and maintainable architectural decisions
Bachelor's degree, required; advanced graduate degree preferred, or equivalent experience in computer science, applied mathematics, engineering, or a related technical field, required. Must have 5 plus years of demonstrated experience using Linux operating system, working knowledge of provisioning and containerization technologies (e.g., Docker, Kubernetes, Swarm), and experience managing CI/CD environment. Ability to build and maintain code repositories with Git, required. Experience with large-scale monitoring and reporting tools (e.g. Nagios, checkmk, ELK, Splunk, New Relic, Grafana), required. Familiarity with virtualization technologies and related tools, network principles, and protocols (e.g. VirtualBox, VMware, KVM, Vagrant, Packer.io, TCP/IP, SSH, HTTP[S], DNS, DHCP, NAT, SNMP, PXE) necessary. Excellent organizational skills, prioritize management, Problem-solving skills, and interpersonal skills, are required. Must have the ability to coordinate both short and long-term projects, build strong relationships across teams. Must have a strong sense of ownership and accountability. Working knowledge of best practices for teaching, mentoring others and advocating fortechnical innovation in development operations, preferred. Experience building and installing open-source software with complex dependencies, strong shell scripting, and programming skills for systems administration purposes (two or more of BASH, Python, Ruby, Perl, or similar), required. Working knowledge of web-based application development (e.g., Rails, Angular, NodeJS), hosting technologies (e.g., Apache, Nginx), working knowledge of cloud infrastructures (e.g., AWS, Azure, Google Compute Engine), understanding of Agile/Kanban techniques, preferred. Experience building and installing open-source software with complex dependencies required. A positive, constructive approach with an emphasis on collaboration and good execution, necessary.
The Rockefeller University is an Equal Opportunity Employer - Minorities/Women/Disabled/Veterans
Compensation Range: Min
Compensation Range: Max
Job Locations US-NY-New York
Category Information Technology
Position Type Regular Full-Time