Description

About this role

As a Model Behavior Architect at Mistral AI, you will be at the forefront of defining and measuring Large Language Model (LLM) behavior. You will work closely with our Science team to define what 'good' looks like for various tasks, including Reasoning, Audio, Alignment, Tools, and Frontier bets.

Responsibilities

Interact with models to identify areas for improvement in model behavior
Gather internal and external feedback on model behavior to scope areas for improvement
Design and implement evaluation pipelines, data guidelines, data generation, and synthetic testing environments
Identify and fix edge case behaviors through rigorous testing
Develop robust evaluation pipelines for model candidates
Collaborate with AI Scientists

About you

You have a deep understanding of linguistics, language, and translation, engineering and code behavior, or LLM agents at work, including reasoning and tool use
You have prior knowledge in training and optimizing model behavior
You are an expert at building robust evaluations
You thrive in dynamic and technically complex environments
You have a track record of delivering innovative, out-of-the-box solutions to address real-world constraints

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://jobs.lever.co/mistral/4337cebc-b951-4528-98f8-ebcb45db5645