Description
About Dialpad
Dialpad is the AI-native business communications platform. We unify calling, messaging, meetings, and contact center on a single platform - powered by AI that understands every conversation in real time.
As a Software Engineer in Observability, you'll be responsible for our metrics and log collection platform. You'll work closely with other Infrastructure engineers to determine resource usage and requirements. You'll also help create tooling, libraries, and documentation that enable other engineers to instrument their own projects. In addition, you'll keep our team informed about trends in the broader observability/monitoring industry.
Responsibilities
- Develop and improve instrumentation for monitoring and logging the health and availability of services.
- Develop and maintain the observability stack within Dialpad engineering.
- Define best practices and standards around making systems and services measurable, and work with various teams to get those best practices applied.
- Create tools and libraries for other engineering teams to enable them to build self-monitoring capabilities.
- Create and own internal documentation used by the other engineering teams.
- Stay up-to-date with the latest trends in observability, logging, monitoring, and cloud technologies. Introduce innovative solutions and best practices to improve system observability and reliability.
- Collaborate with different engineering teams to integrate observability practices into their workflows.
- Participate in a rotating on-call within the larger Infrastructure Engineering division.
Requirements
- Background in both Systems and/or Software Engineering.
- Experience in designing, automating, maintaining, and optimizing observability platforms (logging, metrics, and tracing).
- Experience with configuration management tools such as Ansible, Terraform, etc.
- Experience with Public Cloud environments such as GCP, AWS, etc.
- Familiarity with languages such as Python, Go, Rust, etc.
- Previous direct experience with Grafana, Loki, Prometheus.
- Experience with Linux.
- Experience with Kubernetes (including GKE/EKS) and building containerized applications.
- Undergraduate degree in Computer Science or Engineering.
Why Join Dialpad
- Work at the center of the AI transformation in business communications
- Build and ship agentic AI products that are redefining how companies operate
- Join a team where AI amplifies every employee’s impact
- Competitive salary, comprehensive benefits, and real opportunities for growth
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://job-boards.greenhouse.io/dialpad/jobs/8475165002