Microsoft AI

Member of Technical Staff, High Performance Computing Engineer

Microsoft AI
onsite staff full-time Competitive salary Multiple Locations, United States
Apply →

First indexed 6 Mar 2026

Description

Summary

Microsoft AI are looking for experienced Member of Technical Staff, High Performance Computing Engineers to help build and scale the infrastructure that trains their frontier models and powers the next evolution of their personal AI, Copilot.

About the Role

This role offers the unique opportunity to work on some of the largest scale supercomputers in the world – a rare chance to operate at such a significant scale. As a Member of Technical Staff, High Performance Computing Engineer, you will design, operate, and maintain large-scale HPC environments, drawing on hands-on engineering experience in production settings. You will own the deployment, configuration, and day-to-day operation of HPC schedulers (e.g., SLURM, Kubernetes), ensuring reliable and efficient job scheduling at scale.

Accountabilities

  • Design, operate, and maintain large-scale HPC environments, drawing on hands-on engineering experience in production settings.
  • Own the deployment, configuration, and day-to-day operation of HPC schedulers (e.g., SLURM, Kubernetes), ensuring reliable and efficient job scheduling at scale.

The Candidate we're looking for

Experience:

  • 4+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters.
  • 4+ years experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.).
  • 4+ years experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP.

Technical skills:

  • Experience with LLM training clusters.
  • Experience working with AI platforms, frameworks, and APIs.
  • Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally.

Personal attributes:

  • Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience.
  • Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security.

Benefits

  • Competitive salary.
  • Comprehensive benefits package.
  • Opportunities for professional growth and development.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://microsoft.ai/job/member-of-technical-staff-high-performance-computing-engineer-mai-superintelligence-team/