Description
Job Description
We are seeking an experienced Engineering Manager to lead our Batch Compute Infrastructure team at Stripe. As a key member of our engineering organization, you will be responsible for defining the multi-year roadmap for Stripe's Batch Compute Infrastructure, leading complex architectural shifts and modernization.
Responsibilities
- Drive Strategic Vision: Define the multi-year roadmap for Stripe’s Batch Compute Infrastructure, leading complex architectural shifts and modernization.
- Lead and Scale: Build, mentor, and aggressively scale a high-performing team of engineers, proactively investing in their career development and fostering a culture of operational excellence.
- Ensure Operational Rigor: Maintain unwavering reliability for a Tier-0 infrastructure processing tens of thousands of daily workloads, proactively mitigating risks and managing complex on-call telemetry.
- Cross-Functional Orchestration: Collaborate deeply with data platform teams, finance, and user groups to define compute efficiency metrics, execute massive-scale cost optimization strategies, and guarantee compliance with global financial regulations.
- Technical Stewardship: Provide technical guidance in architecture reviews, evaluating critical cost, performance, and reliability trade-offs in distributed systems design involving Hadoop, Spark, AWS cloud primitives, and modern metastores.
Requirements
- 10+ years of professional software development and engineering experience.
- 3+ years of direct engineering management experience, successfully building and operating high-velocity technical teams.
- Deep technical background in building, scaling, and maintaining large-scale distributed data systems or Tier-0 infrastructure using open-source tools (e.g., Hadoop, Spark, Celeborn, Airflow, Kafka).
- Proven track record of driving significant infrastructure efficiency, managing capacity planning, and making data-driven cost-performance trade-offs.
- Experience working effectively in highly cross-functional, global organizations.
Preferred Requirements
- Experience managing remote or geographically distributed engineering teams.
- Familiarity with managing a massive fleet of Linux servers, on-premise Hadoop clusters, and modern cloud data architectures (e.g., AWS S3, Graviton).
- Demonstrated ability to navigate strategic ambiguity and deliver complex, multi-quarter infrastructural projects from inception to completion.
- Deep empathy for internal data users with a passion for building robust developer tooling and abstractions.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://job-boards.greenhouse.io/stripe/jobs/7827623