# Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

**Company**: Scale
**Location**: San Francisco, CA; New York, NY
**Work arrangement**: onsite
**Experience**: staff
**Job type**: full-time
**Salary**: $189,600-$237,000 USD
**Category**: Engineering
**Industry**: Technology

**Apply**: https://job-boards.greenhouse.io/scaleai/jobs/4625337005
**Canonical**: https://yubhub.co/jobs/job_57a8aa85-77e

## Description

We are seeking a Staff Machine Learning Research Engineer to join our Enterprise ML Research Lab. As a key member of our team, you will build out our next-gen Agent RL training platform, integrating cutting-edge research into our training stack. You will train state-of-the-art models, design solutions for complex multi-agent systems, and collaborate with our team to deploy use-cases ranging from next-generation AI cybersecurity firewall LLMs to training foundation healthtech search models.

The ideal candidate will have 5+ years of LLM training in a production environment, experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO, and publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years. A PhD or Masters in Computer Science or a related field is required.

In addition to a competitive salary, you will receive equity-based compensation, comprehensive health, dental, and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. This role may also be eligible for additional benefits such as a commuter stipend.

## Skills

### Required
- LLM training
- Post-training methods
- RLHF/RLVR
- PPO/GRPO
- NEURIPS
- ICLR
- ICML
- Computer Science
- PhD
- Masters
