# Data Engineer

**Company**: Dropbox
**Location**: Remote - Mexico
**Work arrangement**: remote
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology

**Apply**: https://job-boards.greenhouse.io/dropbox/jobs/7739553?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_3b6f25aa-f10

## Description

Dropbox is looking for a Data Engineer to join the Analytics Data Engineering (ADE) team within Data Science & AI Platform. You will build large, scalable analytics pipelines using modern data technologies.

## Responsibilities

- Define company data assets (data model), Spark, SparkSQL jobs to populate data models

- Define and design data integrations, data quality frameworks and design and evaluate open source/vendor tools for data lineage

- Work closely with Dropbox business units and engineering teams to develop strategy for long term Data Platform architecture to be efficient, reliable and scalable

- Conceptualize and own the data architecture for multiple large-scale projects, while evaluating design and operational cost-benefit tradeoffs within systems

- Collaborate with engineers, product managers, and data scientists to understand data needs, representing key data insights in a meaningful way

- Design, build, and launch collections of sophisticated data models and visualizations that support multiple use cases across different products or domains

- Optimize pipelines, dashboards, frameworks, and systems to facilitate easier development of data artifacts

On-call work may be necessary occasionally to help address bugs, outages, or other operational issues, with the goal of maintaining a stable and high-quality experience for customers.

## Requirements

- 5+ years of Spark, Python, Java, C++, or Scala development experience

- 5+ years of SQL experience

- 5+ years of experience with schema design, dimensional data modeling, and medallion architectures

- Experience with the Databricks platform and data lake architectures for large-scale data processing and analytics

- Excellent product strategic thinking and communications to influence product and cross-functional teams by identifying the data opportunities to drive impact

- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience

- Experience designing, building and maintaining data processing systems

## Preferred Qualifications

- 7+ years of SQL experience

- 7+ years of experience with schema design, dimensional data modeling, and medallion architectures

- Experience with Airflow or other similar orchestration frameworks

- Experience building data quality monitoring using MonteCarlo or similar tools

## Skills

### Required
- Spark
- Python
- Java
- C++
- Scala
- SQL
- Databricks
- data lake architectures

### Nice to have
- Airflow
- MonteCarlo

---

Source: [Apply at job-boards.greenhouse.io](https://job-boards.greenhouse.io/dropbox/jobs/7739553?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
