New The Skills of Tomorrow: how AI-exposed is every skill in 2026? See the data →
NVIDIA

Principal Release Infrastructure Architect

NVIDIA
Apply →
onsite senior full-time Competitive salary and benefits package Santa Clara

First indexed 18 May 2026

Description

Join NVIDIA, where we're powering the future of AI and computing! As a Principal Architect on our powerful team in Santa Clara, CA, you will manage the architecture of our critical release management platform. This role offers a groundbreaking chance to define the technical vision for our modern infrastructure while working with hardworking and versatile professionals in the industry.

You will be responsible for managing the architecture of a full-stack release management platform, advancing it to accommodate multi-tenant, multi-environment systems across multiple hardware platforms. You will craft hierarchical domain models and state machines that manage complex lifecycles and multi-axis promotion flows. You will also architect robust ingestion and reconciliation pipelines, ensuring data fidelity and compliance across various representations.

In addition, you will define and integrate a comprehensive validation, promotion, and gating model with our automated sanity stages and customer-release pipelines. You will establish a strict separation between authoring and production environments, ensuring data integrity and sanitization. You will set standards for API build, authentication, RBAC, audit logging, and observability across the platform.

You will lead frontend architecture for a sophisticated authoring and review experience, handling complex tabular editing and bulk operations. You will drive platform onboarding workflows, defining consistent naming schemes, validation rules, and notification topologies for new hardware platforms.

You will mentor engineers, conduct architecture reviews, and raise the technical bar across multiple engineering domains. You will partner with product, TPM, release managers, and other collaborators to align on roadmaps, capacity, and operational ownership.

You will define and manage deployment, rollout, and incident-response models, including database migration strategies and clear on-call runbooks. You will continuously evaluate and incorporate emerging tools, frameworks, and patterns to improve the platform's capabilities.

BS, MS, or PhD or equivalent experience in Computer Science or a related field is required. Candidates must have over 15 years of hands-on software engineering experience. At least 5 of those years should be in a senior technical leadership role.

Proven experience leading large-scale, multi-year, full-stack platform projects from inception to production is necessary. Expertise in Python or a modern backend stack, with production-grade experience in PostgreSQL and complex relational data modeling, is likewise required.

Strong frontend architecture skills with React, Angular, and TypeScript, passionate about data-heavy UIs, are also necessary. Demonstrated success in crafting and implementing state machines, workflow engines, or lifecycle systems is required.

Solid background in CI/CD orchestration, event-triggered integrations, and background-job systems is necessary. Strong API development field, with experience in REST contracts, versioning, and automation-friendly interfaces, is also required.

Proficiency in Linux, containerization, and managing stateful production services is necessary. Ability to lead through influence, driving architectural decisions across multiple teams and building consensus, is also required.

Excellent communication skills, translating complex business requirements into detailed technical solutions, are necessary.