# Staff Software Engineer - Ingestion

**Company**: Databricks
**Location**: Bengaluru, India
**Experience**: staff
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology
**Wikidata**: https://www.wikidata.org/wiki/Q18350420

**Apply**: https://job-boards.greenhouse.io/databricks/jobs/8200692002?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_adfcdc3b-88c

## Description

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems - from making the next mode of transportation a reality to accelerating the development of medical breakthroughs.

Ingesting data into the Lakehouse is a strategic area of investment for Databricks and a key enabler for Data and AI workflows. Lakeflow Connect is looking to solve this problem by providing ready-to-use, point-and-click connectors for a wide variety of sources, including enterprise applications, databases, cloud storage, message queues, and local files.

In addition to being an important part of Lakeflow and Data Engineering, Connect is also a key platform capability. Every surface in Databricks requires ingestion capabilities and the lead for this role will need to work closely with other products to embed Connect into these surfaces.

We are looking for engineers with experience in core Database internals to join our Lakeflow Connect team. A key part of Connect is to extract data from OLTP systems while imposing minimal load on production systems. To do this efficiently we are building systems that use techniques such as incremental data capture, log parsing, etc.

The Impact you will have:

- Solve real business needs at large scale by applying your software engineering.

- Deliver a highly scalable, available, and fault-tolerant engine processing hundreds of TB of data daily across thousands of customers.

- Perform low level systems debugging, performance measurement & optimization on large production clusters.

- Build architecture design, influence product roadmap, and take ownership and responsibility over new projects.

- Use your deep experience to help prevent and investigate production issues.

- Plan and lead complicated technical projects that work with several teams within the company.

- Act as a strong influencer or driver in the organisation’s roadmap and direction.

- Lead a TLG or similar review committee, or initiate and sustain an org/eng-wide initiative driven by engineering needs.

- Break down complex problems quickly into potential solutions, knowns, and unknowns, and de-risk through prototyping/validation.

- Contribute as a Technical Team Lead by mentoring others, leading sprint planning, delegating work and assignments to team members, and participating in project planning.

What we look for:

- 15+ years industry experience building and supporting large-scale distributed systems.

- Experience in areas like Database replication, backup, transaction recovery at one of the major database vendors.

- Comfortable working towards a multi-year vision with incremental deliverables.

- Motivated by delivering customer value and impact.

- Strong foundation in algorithms and data structures and their real-world use cases.

- Experience driving company initiatives towards customer satisfaction.

Benefits: At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees.

## Skills

### Required
- Database replication
- backup
- transaction recovery
- large-scale distributed systems
- algorithms
- data structures

---

Source: [Apply at job-boards.greenhouse.io](https://job-boards.greenhouse.io/databricks/jobs/8200692002?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
