New The Skills of Tomorrow: how AI-exposed is every skill in 2026? See the data →
NVIDIA

AI Computing Development Engineer, TensorRT and TensorRT-LLM

NVIDIA
Apply →
mid full-time Shanghai

First indexed 29 May 2026

Description

Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines.

Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms.

Perform performance analysis, optimization, and tuning of deep learning inference workloads.

Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly.

Provide feedback into architecture and hardware design and development.

Collaborate across hardware, software, and research teams to shape the direction of machine learning inferencing across NVIDIA platforms.

Own and deliver technical work with scope based on experience, ranging from complex features to substantial parts of larger projects, with increasing independence and technical leadership over time.

Publish key technical results at leading scientific and engineering conferences.