Description
We're seeking an exceptional Staff Software Engineer to join our Observability team at Pinterest. This role combines deep technical expertise in distributed systems and data engineering with a product-oriented mindset to build world-class observability solutions that empower our engineering organisation.
As a Staff Engineer on the Observability team, you'll be responsible for designing and building the infrastructure and tools that provide visibility into Pinterest's large-scale distributed systems, helping thousands of engineers understand, debug, and optimise their services.
Key responsibilities include:
- Defining and executing the observability roadmap, treating it as a product.
- Architecting, building, and scaling distributed observability infrastructure (metrics, logs, traces) to handle massive volumes across Pinterest's distributed systems.
- Building high-performance data pipelines and storage for real-time and historical telemetry analysis at Pinterest scale.
- Championing Best Practices: Establishing observability standards and patterns across the organisation, making it easy for teams to instrument their services and gain actionable insights.
- Technical Leadership: Mentoring engineers, leading architectural reviews, and influencing technical decisions across teams to improve overall system reliability and performance.
- Cross-functional Collaboration: Partnering with SRE, Infrastructure, Product Engineering, and other teams to understand pain points and deliver solutions that improve developer productivity and system reliability.
- Innovation: Staying current with observability trends and technologies, evaluating and adopting cutting-edge tools and techniques to keep Pinterest at the forefront.
To succeed in this role, you'll need:
- A bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
- A product mindset, with demonstrated ability to work backwards from customer needs.
- Deep expertise in distributed systems, with 7+ years of experience designing and operating large-scale distributed systems.
- Strong background in building data pipelines, working with time-series databases, columnar storage, stream processing, and data modelling at scale.
- Hands-on experience with modern observability tools and practices, including metrics, logging, tracing, and profiling.
- Expert-level coding skills in languages like Java, Python, Go, or Scala.
- Ability to see the big picture while managing complex technical details, balancing trade-offs between cost, performance, and reliability.
- Experience building observability platforms from the ground up or significantly scaling existing solutions.
- Familiarity with cloud-native architectures and technologies.
- Track record of driving adoption of internal platforms through excellent documentation, UX, and developer advocacy.
- Experience with machine learning or anomaly detection applied to observability use cases.
- Strong communication skills with ability to influence stakeholders at all levels.
In-office requirement statement: We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection. This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.
Relocation statement: This position is not eligible for relocation assistance.
At Pinterest, we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise.