# Data Engineer - Commodities

**Company**: FIC & Risk Technology
**Location**: Old Greenwich, Connecticut
**Work arrangement**: hybrid
**Experience**: senior
**Job type**: full-time
**Salary**: $175,000 to $250,000
**Category**: IT
**Industry**: Finance

**Apply**: https://mlp.eightfold.ai/careers/job/755956566830?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_e39102e7-c35

## Description

The Commodities Technology team at FIC & Risk Technology builds and operates a data platform that aggregates and curates critical commodities data. This includes weather, supply/demand, storage, transportation, and other fundamental and alternative datasets.

We are seeking a Commodities Content Engineer who will focus on building robust ETL workflows and data models on top of our commodities data platform. In this role, you will use Python and SQL to design, implement, and maintain pipelines that ingest, clean, transform, and catalog commodities datasets.

Key Responsibilities:

- Design and implement end-to-end ETL workflows in Python and SQL to ingest and transform commodities data from multiple vendors and internal sources.

- Build and maintain standardized data models, schemas, and metadata that make commodities datasets easy to understand and discover within the platform.

- Use Airflow or similar tools to schedule, monitor, and manage data pipelines, ensuring reliability and timely delivery.

- Implement robust validation, reconciliation, and anomaly-detection checks to ensure data completeness, correctness, and consistency.

- Leverage AI to automate schema inference across structured and semi-structured data sources, manage schema drift, and accelerate development of scalable ingestion pipelines.

- Apply AI-driven data quality, observability, and documentation capabilities to detect anomalies, monitor data health, and generate clear lineage and technical documentation across complex data workflows.

- Leverage Git, GitHub Actions, and automated testing (PyTest) to maintain high-quality code and repeatable deployments.

- Partner with commodities PMs, researchers, and data strategists to understand use cases and continuously refine datasets, definitions, and documentation.

Required Qualifications:

- 4 years of experience in data engineering, analytics engineering, or similar roles focused on building and maintaining ETL pipelines.

- Strong skills in Python and SQL, with experience working with large datasets and complex transformations.

- Hands-on experience with Airflow or other workflow schedulers.

- Familiarity with version control (Git), CI/CD pipelines (GitHub Actions or equivalent), and test automation (e.g., PyTest).

- Strong attention to detail, data quality, and documentation; ability to reason for edge cases and data integrity.

- Ability to work independently, communicate clearly with both technical and non-technical stakeholders, and manage work across multiple concurrent initiatives.

Preferred Qualifications:

- Knowledge of commodities markets and commodities data (e.g., weather, supply/demand, storage, freight, flows).

- Experience with data warehousing technologies (e.g., Snowflake, columnar storage formats, or analytic databases).

- Prior experience in a financial services, trading, or research-driven environment.

- Exposure to data catalog/data governance tools and best practices.

The estimated base salary range for this position is $175,000 to $250,000, which is specific to New York and may change in the future. Millennium pays a total compensation package which includes a base salary, discretionary performance bonus, and a comprehensive benefits package.

## Skills

### Required
- Python
- SQL
- Airflow
- Git
- GitHub Actions
- PyTest
- ETL pipelines
- data modeling
- data quality

### Nice to have
- commodities markets
- data warehousing
- financial services
- data catalog
- data governance

---

Source: [Apply at mlp.eightfold.ai](https://mlp.eightfold.ai/careers/job/755956566830?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
