Description
As a Software Engineer in NVIDIA's Internal Infrastructure Group, you'll design and build distributed systems that power the workflows behind our next generation of GPUs and AI chips. The software you create will help thousands of engineers develop world-changing technology faster, more efficiently, and at scale.
This role focuses on building new infrastructure systems and distributed workflow platforms.
Responsibilities
- Build and extend scalable, high-performance core infrastructure systems and workflow platforms that improve reliability and developer productivity across NVIDIA's chip-design ecosystem.
- Design and optimize distributed systems that orchestrate millions of regression and validation workloads across heterogeneous compute environments.
- Design systems that coordinate dependency-aware execution across large-scale compute clusters.
- Define system architecture including APIs, data models, execution models, and scaling strategies.
- Own systems end-to-end, from gathering requirements and proposing technical designs to implementation, performance analysis, testing, and deployment.
- Collaborate with internal teams to understand workflows, identify bottlenecks, and deliver automation that accelerates engineering workflows.
- Analyze and tune system performance across distributed services using profiling, tracing, and telemetry.
Requirements
- BS or MS in Computer Science or a related field (or equivalent experience).
- 9+ years of professional software development experience.
- Strong foundation in data structures, algorithms, concurrency, and distributed system design.
- Demonstrated experience designing and building distributed systems from first principles , including defining APIs, data models, execution flows, and scaling approaches.
- Experience owning systems through design, implementation, and evolution , including handling trade-offs, failure modes, and system limitations.
- Experience working on systems involving scheduling, dependency resolution, or large-scale job orchestration.
- Proficiency in modern programming languages (Python, C++, Go, or similar) on Linux systems, with experience building large-scale systems or infrastructure software.
- Ability to clearly articulate architecture decisions, trade-offs, and how systems evolved over time.
Nice to Have
- Experience improving developer productivity through infrastructure or platform design.
- Hands-on familiarity with profiling, tracing, or performance-optimization techniques.
- Understanding of chip-design, verification, or modern ML workflows.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/India-Bengaluru/Senior-Infrastructure-Software-Systems-Engineer_JR2016240