Description
We're seeking an experienced Infrastructure Engineer/SRE to join our engineering team. As a key member of our infrastructure team, you will be responsible for designing, building, and advancing our core infrastructure that allows the engineering team to execute quickly, productively, and securely.
As a collaborative but highly autonomous working environment, each member has a defined role with clear expectations, as well as the freedom to pursue projects they find interesting.
Responsibilities:
- Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
- Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
- Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
- Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers.
- Automate operations and engineering. Focus on automation so we can spend energy where it matters.
- Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets.
What we are looking for:
- 5+ years experience in DevOps, Site Reliability Engineering, Production Engineering, or equivalent field.
- Deep proficiency with coding languages such as Golang or Python.
- Deep familiarity with container-related security best practices.
- Production experience working with Kubernetes, and a deep understanding of the Kubernetes ecosystem, including popular open-source tooling such as cert-manager or external-dns.
- Experience with GPU-enabled clusters is a bonus.
- Production experience with Kubernetes templating tools such as Helm or Kustomize.
- Production experience with IAC tools such as Terraform or CloudFormation.
- Production experience working with AWS and services such as IAM, S3, EC2, and EKS.
- Production experience with other cloud providers such as Google Cloud and Azure is a bonus.
- Production experience with database software such as PostgreSQL.
- Experience with GitOps tooling such as Flux or Argo.
- Experience with CI/CD such as GitHub Actions.
Perks & Benefits:
- We offer Cresta employees a variety of medical benefits designed to fit your stage of life.
- Flexible vacation time to promote a healthy work-life blend.
- Paid parental leave to support you and your family.
Compensation for this position includes a base salary, equity, and a variety of benefits. Actual base salaries will be based on candidate-specific factors, including experience, skillset, and location, and local minimum pay requirements as applicable.