# Senior Software Engineer - NIM Platform SDK and Framework

**Company**: NVIDIA
**Location**: Santa Clara
**Work arrangement**: onsite
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology

**Apply**: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-Software-Engineer---NIM-Platform-SDK-and-Framework_JR2015499?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_c39db33f-1d0

## Description

We seek a senior engineer to own and evolve the core NIM Platform SDK and microservice framework. This framework powers NVIDIA Inferencing Microservices (NIM). The ideal candidate has deep systems engineering skills and a passion for building foundational platform libraries. These libraries support multiple NIM modalities in delivering production-ready AI inference at scale.

This is a hands-on, deeply technical role for someone who thrives on building core platforms that scale. The role involves solving deep software engineering challenges. These include high-performance systems programming, multi-cloud abstractions, and API framework development. The role requires collaboration across NIM product teams and delivering production-grade software supporting NVIDIA and the wider AI ecosystem.

**Key Responsibilities:**

- Develop and advance the inference microservice framework: OpenAI-compatible API endpoints, inference backend integrations (vLLM, SGLang, TensorRT-LLM, Dynamo), middleware, observability instrumentation, and production hardening across cloud, on-prem, and Kubernetes environments.

- Architect significant new features in open-source codebases, shepherding them through project acceptance and into production.

- Build and optimize high-performance model download and caching pipelines across multiple cloud storage backends (NGC, HuggingFace, S3, GCS) - parallel transfers, integrity verification, and seamless multi-cloud operability.

- Implement the model profile and manifest system that ensures NIMs are optimized for every NVIDIA GPU platform - profile selection, validation, and multi-GPU configuration.

- Develop and refine cloud microservice patterns - service discovery, health checking, graceful degradation, API gateway integration, and end-to-end request lifecycle management - to ensure NIMs operate reliably at scale in diverse cloud deployment environments.

- Be a role model for high-quality code across Python, Rust, and C/C++, and model guidelines in test-driven development, agentic AI-assisted development, code review, and cross-team collaboration.

- Mentor teammates and establish high engineering standards for container quality, security, and operability.

**Requirements:**

- BS or MS in Computer Science, Computer Engineering, or related field (or equivalent experience).

- 8+ years of demonstrated experience developing performant microservice, cloud software and/or platform infrastructure roles.

- Deep technical expertise in cloud-native microservice architecture, including service mesh, API gateways, load balancing, and distributed system build patterns.

- Expertise in high-performance data pipelines with parallel I/O, caching strategies, and integrity verification across distributed storage systems.

- Solid understanding of containerized application delivery using technologies such as Docker, Kubernetes, and Helm.

- Understanding of application security principles, including secure coding practices, vulnerability mitigation, secrets management, and supply chain integrity for containerized environments.

- Strong problem-solving skills grounded in first-principles reasoning and critical analysis.

- Excellent programming skills in Python and Rust, with strong foundations in algorithms, development patterns, and software engineering principles.

**Nice to Have:**

- Direct involvement in open-source inference backends such as vLLM, TRTLLM, or SGLang.

- Direct involvement in disaggregated serving frameworks like NVIDIA Dynamo.

- Experience building and operating production microservices at scale.

- Deep knowledge of multi-cloud deployment strategies across AWS, GCP, Azure, and OCI.

- Experience operating in regulated, air-gapped, or disconnected environments where strict security and compliance controls are required.

## Skills

### Required
- cloud-native microservice architecture
- service mesh
- API gateways
- load balancing
- distributed system build patterns
- high-performance data pipelines
- parallel I/O
- caching strategies
- integrity verification
- containerized application delivery
- Docker
- Kubernetes
- Helm
- application security principles
- secure coding practices
- vulnerability mitigation
- secrets management
- supply chain integrity
- Python
- Rust
- algorithms
- development patterns
- software engineering principles

### Nice to have
- open-source inference backends
- disaggregated serving frameworks
- multi-cloud deployment strategies
- regulated environments

---

Source: [Apply at nvidia.wd5.myworkdayjobs.com](https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-Software-Engineer---NIM-Platform-SDK-and-Framework_JR2015499?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
