Description
We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture.
Your key responsibilities will include:
- Innovating and developing new AI systems technologies for efficient inference
- Designing, implementing, and optimising kernels for high-impact AI workloads
- Designing and implementing extensible abstractions for LLM serving engines
- Building efficient just-in-time domain-specific compilers and runtimes
- Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
- Contributing to open-source communities like FlashInfer, vLLM, and SGLang
You will also be eligible for equity and benefits.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-AI-Software-Engineer--Kernel-Libraries_JR2014705