# Engineering Manager -  Batch Compute Infrastructure

**Company**: Stripe
**Location**: Bengaluru
**Work arrangement**: remote
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology

**Apply**: https://job-boards.greenhouse.io/stripe/jobs/7827623
**Canonical**: https://yubhub.co/jobs/job_551df03a-e42

## Description

## Job Description

We are seeking an experienced Engineering Manager to lead our Batch Compute Infrastructure team at Stripe. As a key member of our engineering organization, you will be responsible for defining the multi-year roadmap for Stripe's Batch Compute Infrastructure, leading complex architectural shifts and modernization.

## Responsibilities

- Drive Strategic Vision: Define the multi-year roadmap for Stripe’s Batch Compute Infrastructure, leading complex architectural shifts and modernization.

- Lead and Scale: Build, mentor, and aggressively scale a high-performing team of engineers, proactively investing in their career development and fostering a culture of operational excellence.

- Ensure Operational Rigor: Maintain unwavering reliability for a Tier-0 infrastructure processing tens of thousands of daily workloads, proactively mitigating risks and managing complex on-call telemetry.

- Cross-Functional Orchestration: Collaborate deeply with data platform teams, finance, and user groups to define compute efficiency metrics, execute massive-scale cost optimization strategies, and guarantee compliance with global financial regulations.

- Technical Stewardship: Provide technical guidance in architecture reviews, evaluating critical cost, performance, and reliability trade-offs in distributed systems design involving Hadoop, Spark, AWS cloud primitives, and modern metastores.

## Requirements

- 10+ years of professional software development and engineering experience.

- 3+ years of direct engineering management experience, successfully building and operating high-velocity technical teams.

- Deep technical background in building, scaling, and maintaining large-scale distributed data systems or Tier-0 infrastructure using open-source tools (e.g., Hadoop, Spark, Celeborn, Airflow, Kafka).

- Proven track record of driving significant infrastructure efficiency, managing capacity planning, and making data-driven cost-performance trade-offs.

- Experience working effectively in highly cross-functional, global organizations.

## Preferred Requirements

- Experience managing remote or geographically distributed engineering teams.

- Familiarity with managing a massive fleet of Linux servers, on-premise Hadoop clusters, and modern cloud data architectures (e.g., AWS S3, Graviton).

- Demonstrated ability to navigate strategic ambiguity and deliver complex, multi-quarter infrastructural projects from inception to completion.

- Deep empathy for internal data users with a passion for building robust developer tooling and abstractions.

## Skills

### Required
- Hadoop
- Spark
- Celeborn
- Airflow
- Kafka
- Linux
- AWS
- Cloud Computing

### Nice to have
- Remote Engineering Management
- Distributed Systems Design
- Cloud Architecture
- DevOps
- Agile Methodologies
