# Research Engineer, Cybersecurity Reinforcement Learning

**Company**: Anthropic
**Location**: San Francisco, CA, New York City, NY
**Work arrangement**: hybrid
**Experience**: senior
**Job type**: full-time
**Salary**: $300,000 - $405,000 USD
**Category**: Engineering
**Industry**: Technology
**Wikidata**: https://www.wikidata.org/wiki/Q116758847

**Apply**: https://job-boards.greenhouse.io/anthropic/jobs/5025624008
**Canonical**: https://yubhub.co/jobs/job_b0188062-45f

## Description

## About the role

We're hiring for the Cybersecurity RL team within Horizons. As a Research Engineer, you'll help to safely advance the capabilities of our models in secure coding, vulnerability remediation, and other areas of defensive cybersecurity.

This role blends research and engineering, requiring you to both develop novel approaches and realise them in code. Your work will include designing and implementing RL environments, conducting experiments and evaluations, delivering your work into production training runs, and collaborating with other researchers, engineers, and cybersecurity specialists across and outside Anthropic.

## You may be a good fit if you:

- Have experience in cybersecurity research.

- Have experience with machine learning.

- Have strong software engineering skills.

- Can balance research exploration with engineering implementation.

- Are passionate about AI's potential and committed to developing safe and beneficial systems.

## Strong candidates may also have:

- Professional experience in security engineering, fuzzing, detection and response, or other applied defensive work.

- Experience participating in or building CTF competitions and cyber ranges.

- Academic research experience in cybersecurity.

- Familiarity with RL techniques and environments.

- Familiarity with LLM training methodologies.

## Logistics

**Education requirements:** We require at least a Bachelor's degree in a related field or equivalent experience. **Location-based hybrid policy:** Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

**Visa sponsorship:** We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

## How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time.

## Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lot more.

## Skills

### Required
- cybersecurity research
- machine learning
- software engineering
- RL techniques and environments
- LLM training methodologies

### Nice to have
- security engineering
- fuzzing
- detection and response
- CTF competitions and cyber ranges
- academic research in cybersecurity