Thinking Machines Lab

Software Engineer, Data Infrastructure

Thinking Machines Lab
onsite entry|mid|senior full-time $350,000 - $475,000 USD San Francisco
Apply →

First indexed 18 Apr 2026

Description

We're looking for an engineer to join our small, high-impact team responsible for architecting and scaling the core infrastructure behind distributed training pipelines, multimodal data catalogs, and intelligent processing systems that operate over petabytes of data.

As a software engineer on our data infrastructure team, you'll design, build, and operate scalable, fault-tolerant infrastructure for LLM Research: distributed compute, data orchestration, and storage across modalities. You'll develop high-throughput systems for data ingestion, processing, and transformation , including training data catalogs, deduplication, quality checks, and search. You'll also build systems for traceability, reproducibility, and robust quality control at every stage of the data lifecycle.

You'll collaborate with research teams to unlock new features, improve data quality, and accelerate training cycles. You'll implement and maintain monitoring and alerting to support platform reliability and performance.

If you're excited by distributed systems, large-scale data mining, open-source tools like Spark, Kafka, Beam, Ray, and Delta Lake, and enjoy building from the ground up, we'd love to hear from you.

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://job-boards.greenhouse.io/thinkingmachines/jobs/5013919008