Description

Job Description

We are seeking an experienced Engineering Manager to lead our Batch Compute Infrastructure team at Stripe. As a key member of our engineering organization, you will be responsible for defining the multi-year roadmap for Stripe's Batch Compute Infrastructure, leading complex architectural shifts and modernization.

Responsibilities

Drive Strategic Vision: Define the multi-year roadmap for Stripe’s Batch Compute Infrastructure, leading complex architectural shifts and modernization.
Lead and Scale: Build, mentor, and aggressively scale a high-performing team of engineers, proactively investing in their career development and fostering a culture of operational excellence.
Ensure Operational Rigor: Maintain unwavering reliability for a Tier-0 infrastructure processing tens of thousands of daily workloads, proactively mitigating risks and managing complex on-call telemetry.
Cross-Functional Orchestration: Collaborate deeply with data platform teams, finance, and user groups to define compute efficiency metrics, execute massive-scale cost optimization strategies, and guarantee compliance with global financial regulations.
Technical Stewardship: Provide technical guidance in architecture reviews, evaluating critical cost, performance, and reliability trade-offs in distributed systems design involving Hadoop, Spark, AWS cloud primitives, and modern metastores.

Requirements

10+ years of professional software development and engineering experience.
3+ years of direct engineering management experience, successfully building and operating high-velocity technical teams.
Deep technical background in building, scaling, and maintaining large-scale distributed data systems or Tier-0 infrastructure using open-source tools (e.g., Hadoop, Spark, Celeborn, Airflow, Kafka).
Proven track record of driving significant infrastructure efficiency, managing capacity planning, and making data-driven cost-performance trade-offs.
Experience working effectively in highly cross-functional, global organizations.

Preferred Requirements

Experience managing remote or geographically distributed engineering teams.
Familiarity with managing a massive fleet of Linux servers, on-premise Hadoop clusters, and modern cloud data architectures (e.g., AWS S3, Graviton).
Demonstrated ability to navigate strategic ambiguity and deliver complex, multi-quarter infrastructural projects from inception to completion.
Deep empathy for internal data users with a passion for building robust developer tooling and abstractions.

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://job-boards.greenhouse.io/stripe/jobs/7827623