Description

Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines.

Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms.

Perform performance analysis, optimization, and tuning of deep learning inference workloads.

Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly.

Provide feedback into architecture and hardware design and development.

Collaborate across hardware, software, and research teams to shape the direction of machine learning inferencing across NVIDIA platforms.

Own and deliver technical work with scope based on experience, ranging from complex features to substantial parts of larger projects, with increasing independence and technical leadership over time.

Publish key technical results at leading scientific and engineering conferences.

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/China-Shanghai/AI-Computing-Development-Engineer--TensorRT-and-TensorRT-LLM_JR2018666