In this role, the Senior SRE manager will be leading the SRE functions with a diverse team of systems engineers in close collaboration with SRE teams to transform global infrastructure planning, design, delivery, and operations to support NVIDIA’s growth. The successful candidate will be able to demonstrate in-depth system knowledge across several technical domains like Unix/Linux, Windows, Networking, Virtualization, and have a track record of leading teams that deliver infrastructure at scale. Ability to transform designs ground up and lead innovation in system design. Structured thinking and problem-solving skills, along with exceptional communication abilities will be crucial for success in this role as you build strong teams that partner with engineering and operations teams across NVIDIA.
What you’ll be doing:
Lead and grow a large team of systems engineers and SRE developing and maintaining key infrastructure services used across NVIDIA.
Transform teams of systems engineers into SRE teams.
Build roadmaps for the next generation of infrastructure at Nvidia IT. Work with NVIDIA leadership, senior engineers, program managers, and product managers to develop and transform IT infrastructure into compelling products and services.
Manage capacity planning with capex and opx of >5 million of dollars annually.
What we need to see:
BS in Engineering, CS (or equivalent experience)
10+ overall years of industry experience & 4+ years of experience in infrastructure design and architectural leadership
Deep understanding of Linux, LDAP, Tools, Virtualization & Config Management in a large linux based engineering environment. Experience working in a hybrid on-prem and public cloud environment.
Experience with Automation and Monitoring platforms, DevOPS and AIOPS.
Deep understanding of Network services like DNS, AD..
Experience recruiting and managing engineering teams, including performance management
Collaborative approach and strong problem-solving skills to work with leaders across organizational and technical domains
Excellent written and verbal social skills to manage effectively across cultures and geographies
Implementing reliable and scalable automation solutions using out of box thinking, delivering high quality end user experiences.
Ways to stand out from the crowd:
Demonstrated ability to deliver exponential features with improved reliability.
Track record with delivering high availability in complex, global environments
Demonstrated experience with SRE and Software development.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
The base salary range is 196,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.