Description
We're hiring a Senior Staff Software Engineer to own the engineering efforts across NVIDIA enterprise systems. You'll partner with IT leadership to transform reactive support into strategic, AI-infused automated resolution systems and prevent problems before they occur, balancing speed, security, and an exceptional user experience for NVIDIA.
Key Responsibilities:
- Design and implement agentic AI workflows using LLM-based agents, tool calling, RAG patterns, and orchestration frameworks.
- Build robust integrations and automation pipelines across ServiceNow, identity management, monitoring platforms, and enterprise SaaS.
- Triage and resolve Enterprise issues with a focus on automation and improving mitigation and resolution times.
- Manage and troubleshoot Enterprise-scale collaboration, productivity, AI, and Infrastructure systems.
- Trace and root cause complex, multi-system failures. Identify patterns in recurring tickets, and build automation or self-service solutions.
- Build and maintain runbooks, troubleshooting guides, and knowledge base articles that elevate team capabilities.
- Mentor team members on troubleshooting methodology and systems thinking.
Requirements:
- Bachelor's or Master's degree in Computer Science, Engineering, IT, or related field (or equivalent experience).
- 12+ overall years experience in SRE, Enterprise Support, or DevOps.
- Experience with SaaS, hybrid cloud, AI/ML environments.
- Experience building production-grade agentic workflows (e.g., multi-agent systems and MCP servers).
- Software engineering fundamentals with deep experience in building products and operating large-scale systems.
- Expertise in two or more backend languages such as Go, Python, or Java with a track record of owning complex production systems.
- Full-stack engineering experience, including building user-facing web applications and operational dashboards using modern frontend frameworks such as React.js, along with backend APIs and data pipelines.
- Systems thinker who naturally traces dependencies, considers second-order effects, and asks 'why did this break?' not just 'how do I fix it?'
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-Staff-Software-Engineer---Agentic-Automation_JR2016660