# Manager, Distinguished Engineer - DGX Systems Software

**Company**: NVIDIA
**Location**: Santa Clara
**Work arrangement**: onsite
**Experience**: senior
**Job type**: full-time
**Salary**: $150,000–$250,000
**Category**: Engineering
**Industry**: Technology

**Apply**: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Manager--Distinguished-Engineer---DGX-Systems-Software_JR2016463?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_5800c4ac-4a3

## Description

We are seeking an engineering leader responsible for end-to-end delivery of every DGX compute system,from firmware through the AI stack to customer deployment. You will ensure each DGX product ships as a production-ready system where firmware, OS, drivers, CUDA, networking, and AI applications work together seamlessly, while driving architecture and roadmap for next-generation platforms.

Key responsibilities include:

- End-to-End Stack Readiness: Ensure every DGX platform is ready for the full NVIDIA software stack,firmware, DGX OS, GPU drivers, CUDA toolkit, DCGM, DOCA/OFED, and management tools,as a validated, production-quality product.

- Platform Firmware Development: Lead development of the manageability firmware stack (BMC, BIOS) for all DGX platforms.

- Validation Strategy: Define validation strategy proving each DGX platform is production-ready: end-to-end system validation including firmware regression, NVQual certification, DL workload performance, OS/CUDA stack testing, multi-user scenarios, power/thermal validation, and field upgrade reliability.

- Platform Bring-Up & Architecture: Drive platform bring-up for each new DGX system,coordinating first boot across new silicon (CPU, GPU), board design, and firmware teams.

- Customer Deployment & Enablement: Ensure firmware release flows meet CSP and enterprise deployment requirements.

- Product Delivery Lifecycle: Own the complete DGX delivery lifecycle,system architecture, firmware development, integration, full-stack validation, GA release, and customer deployment,for every DGX product.

- Cross-Org Alignment: Serve as single point of accountability for DGX platform readiness across NVIDIA,aligning GPU, CPU, networking, security, OS, and AI software teams to deliver on schedule.

- Quality & Vendor Management: Own RCCA processes for field issues. Manage external vendor partnerships (AMI for SBIOS, BMC contributors) with clear quality gates and program tracking.

- Team Leadership: Build and lead a world-class engineering organization. Mentor and develop leaders. Foster a culture of technical excellence, intellectual honesty, and customer obsession.

## Skills

### Required
- server system stack
- SBIOS
- BMC
- OS
- applications
- system-level integration
- complex multi-component products
- firmware validation
- full-stack system testing
- field deployment
- end-to-end product quality
- engineering organizations
- matrix environment
- server hardware
- CPU
- GPU
- interconnect
- memory
- PCIe
- power delivery

### Nice to have
- NVIDIA DGX
- GPU-accelerated server platforms
- DMTF Redfish
- OCP standards
- server manageability ecosystems
- AI/DL workload validation
- performance optimization

---

Source: [Apply at nvidia.wd5.myworkdayjobs.com](https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Manager--Distinguished-Engineer---DGX-Systems-Software_JR2016463?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
