NVIDIA is searching for several outstanding large language model and foundation model research interns to join our Research team. We are passionate about research that pushes boundaries but also has impact in the real world. You will be part of an amazing collaborative research team that consistently publishes at the top venues in machine learning and systems. Our existing expertise includes large language model, generative models, and so forth. Your contributions have the chance to create real impact on our products.
What you'll be doing:
Research, design and implement novel large language model and foundation models
Perform model optimization, compression and acceleration
Publish original research
Collaborate with other team members and teams
Speak at conferences and events
Transfer technology to product groups
Collaborate with external researchers
What we need to see:
Currently pursuing a Ph.D. in Computer Science/Engineering, Electrical Engineering
Strong background of theory and practice of LLM and foundation model, as well as deep learning
Excellent knowledge of theory and practice of model compression and acceleration techniques
Excellent programming skills in some rapid prototyping environment such as Python; C++ and parallel programming (e.g., CUDA) is a plus
Knowledge of common machine learning frameworks, such as PyTorch
Outstanding research track record, very good publication record
Excellent communication skills
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and productive people in the world working for us. If you're creative and autonomous, we want to hear from you!