Description

We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture.

Your key responsibilities will include:

Innovating and developing new AI systems technologies for efficient inference
Designing, implementing, and optimising kernels for high-impact AI workloads
Designing and implementing extensible abstractions for LLM serving engines
Building efficient just-in-time domain-specific compilers and runtimes
Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
Contributing to open-source communities like FlashInfer, vLLM, and SGLang

You will also be eligible for equity and benefits.

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-AI-Software-Engineer--Kernel-Libraries_JR2014705