# Principal Silicon Failure Analysis Engineer

**Company**: NVIDIA
**Location**: Santa Clara
**Work arrangement**: onsite
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology

**Apply**: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Principal-Silicon-Failure-Analysis-Engineer_JR2018430?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_0c0bc638-4df

## Description

We are seeking a Principal Failure Analysis Engineer to lead Silicon Failure Analysis (SiFA) Lab Infrastructure, responsible for enabling a high-availability, safe, and scalable failure analysis environment.

This role leads the lab framework including facilities, utilities, tool enablement, safety, access control, and operational readiness so that Fault Isolation (FI), Physical Failure Analysis (PFA), and Supplier Quality Engineering (SQE) teams can efficiently root cause our groundbreaking semiconductor products.

The role partners closely with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment vendors to ensure reliable, secure, and scalable lab operations aligned with NVIDIA's technology roadmap.

Responsibilities:

- Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure, ensuring a safe, highly available, and scalable environment that enables FI, PFA, and SQE teams to efficiently root-cause advanced semiconductor issues

- Own day-to-day lab operations and infrastructure readiness, serving as the primary point of accountability for availability, reliability, and rapid resolution of infrastructure issues impacting failure analysis operations

- Manage lab facilities and utilities including power, backup power, cooling water, DI/PCW, exhaust, vacuum, CDA, nitrogen, and specialty gases, coordinating upgrades, maintenance, outages, and construction to minimize disruption

- Drive failure analysis tool enablement and reliability from delivery through sustained operation, ensuring preventive maintenance and improving uptime, availability, MTBF, MTTR, and PM compliance

- Lead vendor and cross-functional partnerships with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment suppliers to reduce downtime and ensure operational resilience

- Own consumables, inventory, and asset management including gases, chemicals, PPE, and materials, with accurate tracking of inventory, asset lifecycle status, and preventative maintenance schedules

- Ensure safety, chemical, ESD and regulatory governance by partnering with Corporate EHS, maintaining lab operating specifications, auditing access controls, and enforcing training, certification, safety, and IP protection requirements

- Define and execute the long-term SiFA lab infrastructure roadmap, leading multi-year planning and phased expansion to support future silicon nodes, advanced packaging technologies, and increasing system complexity

Benefits:

- Competitive salaries

- Generous benefits package

- Eligible for equity

Requirements:

- Bachelor's degree or higher in Engineering or a related technical field or equivalent experience

- 15+ overall years of experience in semiconductor, R&D, or high-precision lab infrastructure

- Demonstrated experience with capital equipment enablement, facilities coordination, and vendor management

- Strong multi-functional leadership, communication, and execution skills

Preferred Qualifications:

- Demonstrated end-to-end ownership of high-availability failure analysis labs to resolve product yield, performance, reliability, and quality issues

- Proven experience enabling and sustaining complex capital tools with metric-driven reliability improvements

- Achieved rigorous safety/compliance governance while delivering on a multi-year scaling roadmap to meet the demands of the latest silicon, packaging, and system challenges

## Skills

### Required
- silicon failure analysis
- lab infrastructure
- facilities management
- vendor management
- capital equipment enablement
- preventive maintenance
- safety governance
- regulatory compliance

---

Source: [Apply at nvidia.wd5.myworkdayjobs.com](https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Principal-Silicon-Failure-Analysis-Engineer_JR2018430?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
