# Member of Technical Staff - Data Research Engineer - MAI Superintelligence Team

**Company**: Microsoft
**Location**: Mountain View
**Work arrangement**: hybrid
**Experience**: staff
**Job type**: full-time
**Salary**: USD $119,800 – $234,700 per year (U.S.) or USD $158,400 – $258,000 per year (San Francisco Bay area and New York City metropolitan area)
**Category**: Engineering
**Industry**: Technology
**Ticker**: MSFT
**Wikidata**: https://www.wikidata.org/wiki/Q2283

**Apply**: https://microsoft.ai/job/member-of-technical-staff-data-research-engineer-mai-superintelligence-team-4/
**Canonical**: https://yubhub.co/jobs/job_f0e01847-2e0

## Description

We are seeking Data Research Engineers to join our Multimodal team, where we are building the next generation of foundation models across vision, language, audio, and beyond. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you. In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse multimodal data sources critical to model development. You will lead efforts to:

Develop novel data collection strategies

Improve dataset quality and integrity

Understand data-driven model behaviors

Align datasets with ethical and societal values

This is a cross-disciplinary, high-impact role ideal for engineers who want to push the boundaries of what AI can learn from data, especially in multimodal contexts.

## Skills

### Required
- Python
- Pandas
- NumPy
- data libraries
- data analysis
- data engineering
- large-scale datasets
- unstructured or semi-structured data
- statistics
- exploratory data analysis methods
- data processing frameworks
- Spark
- Ray
- Apache Beam

### Nice to have
- Master’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline
- 8+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
