Description

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity.

As a Model Behavior Architect, you are at the forefront of defining and measuring LLM behaviour. We are looking for people who have built a career in engineering, machine learning, and large language models and are experts in model evaluation, policy writing, and creating eval pipelines for complicated tasks.

What you will do

Interact with models to identify where model behavior can be improved
Gather internal and external feedback on model behavior to scope areas for improvement
Design and implement evals, data guidelines, data generation, and synthetic testing environments
Identify and fix edge case behaviors through rigorous testing
Develop robust evaluation pipelines for our model candidates
Work collaboratively with AI Scientists

About you

You have a deep understanding of either 1) linguistics, language, and translation, 2) engineering and code behavior, 3) LLM agents at work, including reasoning and tool use
You have prior knowledge in training and optimising model behaviour
You are an expert at building robust evaluations
You thrive in dynamic and technically complex environments
You have a track record of delivering innovative, out-of-the-box solutions to address real-world constraints

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://jobs.lever.co/mistral/4337cebc-b951-4528-98f8-ebcb45db5645