# Sr. Software Engineer - Perf and Benchmarking

**Company**: CoreWeave
**Location**: Sunnyvale, CA
**Work arrangement**: hybrid
**Experience**: senior
**Job type**: full-time
**Salary**: $139,000 to $204,000
**Category**: Engineering
**Industry**: Technology

**Apply**: https://job-boards.greenhouse.io/coreweave/jobs/4626698006?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_e7c9e8a1-9c9

## Description

We are seeking a Senior Software Engineer to join our Benchmarking & Performance team at CoreWeave. As a Senior Engineer, you will play a crucial role in designing, building, and improving our planet-scale performance data warehouse. You will be responsible for ingesting, storing, transforming, and analyzing performance events across our global infrastructure. Additionally, you will contribute to achieving industry-leading end-to-end performance benchmarking publications such as MLPerf.

Responsibilities:

- Design and improve Kubernetes-native benchmarking services to measure latency, throughput, jitter, and cost-per-request across CoreWeave's compute stack.

- Implement and maintain benchmarking workflows for end-to-end MLPerf Training and Inference runs.

- Lead design reviews and drive architecture within the team.

- Mentor junior engineers and review cross-team designs.

- Ensure reproducible and well-documented benchmarking processes.

Requirements:

- 5+ years of experience building distributed systems, high-performance computing, or cloud services.

- Strong coding skills in Python or Go, with deep familiarity with networked systems and performance.

- Hands-on experience with Kubernetes at production scale, CI/CD, and observability stacks.

- Experience with performance-critical GPU systems and model-serving stacks.

- Strong communicator comfortable collaborating with cross-functional teams and external partners.

Nice to have:

- Experience with time-series databases, LSM-based storage engines, or custom data pipelines.

- Experience running MLPerf submissions or similar large-scale audited benchmarks.

- Contributions to OSS projects such as llm-d, vLLM, or PyTorch.

- Exposure to benchmarking large GPU fleets or multi-region clusters.

- Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies.

What We Offer:

- Competitive salary ranging from $139,000 to $204,000.

- Comprehensive benefits package including medical, dental, and vision insurance, company-paid life insurance, and flexible spending account.

- Opportunity to participate in employee stock purchase program and 401(k) with generous employer match.

- Flexible PTO and catered lunch in office and data center locations.

- Casual work environment and culture focused on innovative disruption.

## Skills

### Required
- Python
- Go
- Kubernetes
- distributed systems
- high-performance computing
- cloud services
- GPU systems
- model-serving stacks

### Nice to have
- time-series databases
- LSM-based storage engines
- custom data pipelines
- MLPerf submissions
- OSS projects
- CUDA kernels
- NCCL/SHARP
- RDMA/NUMA
- GPU interconnect topologies

---

Source: [Apply at job-boards.greenhouse.io](https://job-boards.greenhouse.io/coreweave/jobs/4626698006?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
