New The Skills of Tomorrow: how AI-exposed is every skill in 2026? See the data →
NVIDIA

Senior Failure Analysis Engineer

NVIDIA
Apply →
senior full-time Santa Clara, CA

First indexed 18 Jun 2026

Description

NVIDIA is seeking a software-focused Senior Failure Analysis Engineer who can blend deep development with production support ownership.

This hybrid role sits at the intersection of software engineering, data infrastructure, semiconductor development, and production tooling , building and sustaining the intelligent platforms and workflows that power failure analysis, debug, and engineering insight at scale.

You will own the reliability and continuous improvement of production-critical FA systems (databases, CAD navigation tools, and analysis platforms) while partnering with failure analysis, design, verification, CAD, infrastructure, and manufacturing teams.

Responsibilities:

  • Own the reliability, performance, and continuous improvement of production-critical systems, including databases, CAD navigation tools, and failure analysis platforms, ensuring high availability and responsiveness for semiconductor engineering and manufacturing teams.
  • Design and deliver scalable automation frameworks, data pipelines, and intelligent workflows that streamline semiconductor engineering, failure analysis, and production support processes at scale.
  • Build advanced analytics platforms, dashboards, and orchestration systems that turn engineering and production data into clear, actionable insight for faster debug and better decision-making.
  • Apply AI, machine learning, and optimization techniques to reduce manual effort, accelerate root-cause analysis, and strengthen both engineering and production workflows.
  • Partner closely with failure analysis, design, verification, CAD, infrastructure, and production collaborators to deliver reliable, maintainable, and high-impact technical solutions.
  • Drive continuous improvement in software quality, usability, performance, and operational excellence across large-scale compute, data, and production environments.

Requirements:

  • BS or MS in Electrical Engineering, Computer Engineering, Computer Science, or a related technical field, or equivalent experience.
  • 8+ years of professional experience in software engineering, electrical engineering, or semiconductor development/production environments.
  • Strong proficiency in Python, Rust, Shell scripting, or similar languages for building robust automation, tooling, and production systems.
  • Proven track record designing automation frameworks, data-processing systems, or productivity tools with measurable engineering or production impact.
  • Solid experience in Linux environments and modern software engineering guidelines (version control, testing, CI/CD, observability).
  • Exceptional analytical and problem-solving skills with success navigating complex, multidisciplinary technical and production challenges.
  • Strong collaboration and communication skills with proven efficiency across multi-functional engineering and production teams.

Benefits:

  • Competitive salaries
  • Generous benefits package
  • Equity eligibility
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-Failure-Analysis-Engineer_JR2019318