NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for 30 years. It’s a unique legacy of innovation that’s fueled by phenomenal technology and outstanding people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAn, you’ll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
As a Sr. Staff Engineer, you will drive the design and development of scalable cloud platforms across multiple public cloud providers. Your responsibility will also include operating our existing infrastructure to the highest level of reliability and security. The ideal candidate will play a key role in designing, implementing, and maintaining reliable, scalable, and secure cloud-based solutions. As a Sr. Staff SRE, you will collaborate with software engineers, architects, and operations teams to ensure high availability, performance, and reliability of our cloud services and infrastructure. You will work side by side with NVIDIA engineers around the world, as an equal member of a team of high achievers. You will use modern software and agile approach to iteratively releasing software components to automate existing services in a secure and sustainable manner.
What you will be doing:
Leading Cloud Platform Engineering initiatives, including developing, designing, automating, improving and sustaining our standard platform services (on-premises and cloud.)
Developing and implementing cloud systems and architectures on AWS, OCI, Azure, and GCP.
Designing and implementing monitoring and alerting strategies to ensure uptime and reduce MTTD.
Automating manual processes to improve efficiency and reduce human error.
Practicing Agile development methodologies with iterative releases of fully functional solutions, including remediation of existing tech debt.
Mentor and up-skill engineering peers and colleagues in the operational organization.
Collaborating with other engineering teams across NVIDIA to drive the execution of meaningful products for the business.
Help in continuously setting the standard of our code quality and infrastructure design.
Participate in hiring across the organization.
What we need to see:
Bachelor's and/or Master's or equivalent experience in Computer Science or related field of study.
8+ years of experience in Software Development and/or Site Reliability Engineering/Production Engineering.
Strong software development using Python.
Experience with multiple cloud service providers (at least two): AWS, OCI, Azure and/or GCP.
Infrastructure as Code (IaC) automation experience using Terraform CDK or a similar technology.
Source code management: GitLab, GitHub.
Strong systems engineering background in Linux or Windows.
Proficiency in simple yet efficient systems design.
Strong understanding of network design and architecture.
Strong communication skills with the ability to understand and explain technical issues to a non-technical audience.
Ways to stand out from the crowd:
Experience deploying and operating Kubernetes clusters.
Scaling and managing distributed systems.
Significant experience with monitoring and observability platforms such as Datadog.
Comprehensive understanding of web applications security.
Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package.
The base salary range is 164,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.