Senior Software Engineer, Machine Learning Inference

NVIDIA · Santa Clara, CA, United States

Location
Santa Clara
Job Type
Full-time
Posted
May 30, 2026

Job Description

At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.


As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPUs. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you!


What you’ll be doing:
+ Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
+ Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative A...

Ready to Apply?

Submit your application for Senior Software Engineer, Machine Learning Inference at NVIDIA

Apply Now