Description

About the Role

This Research Scientist will focus on scoping, evaluating, red teaming, and defending against societal risks caused by advanced models that emerge over the next few years.

What You’ll Do:

Design and run research experiments to understand the emerging risks models may create
Produce internal & external artifacts (research, products, demos, dashboards, tools) that communicate the state of model capabilities
Shape product, safeguards, and training decisions based on what you find
Work closely with Societal Impacts (SI) and Safeguards teams

Sample Projects:

Build, run, and study an autonomous AI-powered business (e.g. Project Vend), then identify the growth of real autonomous businesses in the wild using Clio and other tools
Build a benchmark for a model’s national security capabilities
Red team unsafeguarded models’ abilities to be used for control
Identify indicators of models being used to scale movements that rely on social control

You May Be a Good Fit If You:

Are a fast experimentalist who ships research quickly
Have experience creating a research program from scratch
Are thoughtful about humanity’s adaptation to powerful AI systems in our economy and society
Can communicate thoughtfully in written + spoken form with a wide range of stakeholders
Can scope ambiguous research questions into tractable first projects

Strong candidates may also have experience with:

Building & maintaining large, foundational infrastructure
Building simple interfaces that allow non-technical collaborators to evaluate AI systems
Working with and prioritizing requests from a wide variety of stakeholders, including research and product teams

Annual Compensation Range:

$320,000-$850,000 USD

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://job-boards.greenhouse.io/anthropic/jobs/5103788008