Description
We're hiring a Research Engineer to join our Code RL team within the RL organization. As a Research Engineer, you'll advance our models' ability to safely write correct, fast code for accelerators.
You'll need to know accelerator performance well to turn it into tasks and signals models can learn from. Specifically, you will:
- Invent, design and implement RL environments and evaluations.
- Conduct experiments and shape our research roadmap.
- Deliver your work into training runs.
- Collaborate with other researchers, engineers, and performance engineering specialists across and outside Anthropic.
We're looking for someone with expertise in accelerators (CUDA, ROCm, Triton, Pallas), ML framework programming (JAX or PyTorch), and experience with balancing research exploration with engineering implementation.
Strong candidates may also have experience with reinforcement learning, porting ML workloads between different types of accelerators, and familiarity with LLM training methodologies.
The annual compensation range for this role is $350,000-$850,000 USD.
Please note that we're an extremely collaborative group, and we value communication skills. The easiest way to understand our research directions is to read our recent research.
We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.