# Senior Software Engineer - Distributed Data Systems

**Company**: Databricks
**Location**: Mountain View, California
**Work arrangement**: onsite
**Experience**: senior
**Job type**: full-time
**Salary**: $166,000-$225,000 USD
**Category**: Engineering
**Industry**: Technology
**Wikidata**: https://www.wikidata.org/wiki/Q18350420

**Apply**: https://job-boards.greenhouse.io/databricks/jobs/6544325002
**Canonical**: https://yubhub.co/jobs/job_601c2dc5-462

## Description

At Databricks, we are enabling data teams to solve the world's toughest problems by building and running the world's best data and AI infrastructure platform. Our customers use deep data insights to improve their business. We are a customer-obsessed company that leaps at every opportunity to solve technical challenges.

As a software engineer on the Runtime team at Databricks, you will be building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance, yet provide the expressiveness and programming abstractions to support diverse workloads ranging from ETL to data science.

Some example projects include:

- Developing the de facto open source standard framework for big data, Apache Spark.

- Providing reliable and high-performance services and client libraries for storing and accessing humongous amounts of data on cloud storage backends, such as AWS S3 and Azure Blob Store.

- Building the next generation query optimizer and execution engine that's fast, tuning-free, scalable, and robust.

We look for candidates with a strong foundation in algorithms and data structures and their real-world use cases, experience with distributed systems, databases, and big data systems, and a BS (or higher) in Computer Science or a related technical field.

The pay range for this role is $166,000-$225,000 USD, and the total compensation package may also include eligibility for annual performance bonus, equity, and benefits.

## Skills

### Required
- Java
- Scala
- C++
- Apache Spark
- Hadoop
- Distributed systems
- Databases
- Big data systems

### Nice to have
- Algorithms
- Data structures
- Real-world use cases
- Cloud storage backends
- Query optimizer
- Execution engine
