We are now looking for a Senior Software Engineer for Quantized Training. We are a team committed to developing next-generation quantized training recipes for Hopper and future GPUs. We are seeking software engineers to help rethink and create tailored solutions to accelerate the discovery of new recipes. This is a coding-heavy role focused on building infrastructure, tooling, and visualizations.
The candidate's work directly supports NVIDIA's production SW systems including Megatron-LM and Transformer Engine. The candidate will be part of a core team of engineers and researchers working in lock step to improve quantized training convergence and efficiency.
What you'll be doing:
Create well-tested SW systems and PoCs in support of quantized training
Build visualization tools to track and assess the health of model training
Benchmark internal and external methods for quantized training
Build an insights platform for tracking model metrics and benchmarks
Architect CI/CD systems for versioning training recipes
Participate in code reviews
What we need to see:
A Masters Degree or PhD or meaningful equivalent experience in Computer Science/Computer Engineering or a related field.
5+ years of relevant software development experience.
Strong software engineering background with a focus on building concise and well-tested code in C++ and Python
Experience working with ML accelerators and PyTorch or similar frameworks
Good foundation in ML training and quantization
Strong written and oral communication skills
Ways to stand out from the crowd:
Experience with CUDA, performance optimization and debugging
Proficient in precision and numerics for ML
GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.
Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. Do you love the challenge of influencing the long-term opportunities that expand NVIDIA’s impact on the datacenter and beyond? If so, we want to hear from you!
The base salary range is 180,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.