# Senior Software Engineer, Release Infra

**Company**: Brex
**Location**: São Paulo, São Paulo, Brazil
**Work arrangement**: hybrid
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology

**Apply**: https://job-boards.greenhouse.io/brex/jobs/8522011002
**Canonical**: https://yubhub.co/jobs/job_103d358c-a85

## Description

Why join us

Brex is the intelligent finance platform that enables companies to spend smarter and move faster in more than 200 markets. By combining global corporate cards and banking with intuitive spend management, bill pay, and travel software, Brex enables founders and finance teams to accelerate operations, gain real-time visibility, and control spend effortlessly.

As a Senior Software Engineer, Infrastructure (Release Engineering) at Brex, you will design, build, and operate the core systems that power Brex’s release, observability, and incident management processes. You will partner closely with product, platform, and operations teams to ensure releases are safe, fast, and reliable, and that our infrastructure scales securely as Brex grows.

Responsibilities

- Design, build, and maintain the release infrastructure that powers Brex’s deployment pipelines and incident workflows

- Drive technical strategy and architecture for release and observability systems, making them more scalable, reliable, and secure

- Collaborate with product, engineering, and operations partners to ensure Brex’s releases are safe, predictable, and low-friction

- Identify and deliver improvements to the end-to-end release process (from code merge to production) to reduce risk and cycle time

- Build and evolve tooling for observability and incident response, enabling fast detection, triage, and resolution

- Proactively identify and mitigate risks in our release and infrastructure stack, including performance, reliability, and security concerns

- Define, instrument, and monitor key metrics for release engineering (e.g., deployment frequency, change failure rate, MTTR) and use them to guide improvements

- Partner with other infrastructure and product teams to debug complex production issues and drive long-term fixes

- Contribute to and champion best practices in release engineering, reliability, and operational excellence across the organization

- Mentor other engineers on the team, providing technical guidance and code reviews to elevate the overall quality of our infrastructure

- Stay up-to-date on emerging tools and practices in release engineering, observability, and SRE, and bring relevant ideas into Brex’s stack

Requirements

- 7+ years of professional experience designing, building, and operating backend or infrastructure systems in production

- Strong proficiency in backend programming languages (e.g., Go, Java, Kotlin, or Python) with a focus on reliability and performance

- Hands-on experience with CI/CD and release pipelines (e.g., GitHub Actions, CircleCI, Buildkite, Argo, Spinnaker, Jenkins) including build, test, and deployment automation

- Experience architecting and operating scalable, high-availability distributed systems on cloud platforms (e.g., AWS, GCP, Azure)

- Deep familiarity with containerization and orchestration (e.g., Docker, Kubernetes) and infrastructure-as-code (e.g., Terraform, CloudFormation)

- Experience designing and maintaining observability tooling (metrics, logs, tracing) and integrating it into incident response workflows

- Strong understanding of reliability and SRE practices, including SLIs/SLOs, error budgets, and incident management best practices

- Experience designing and optimizing data storage systems (SQL and/or NoSQL) for operational and observability use cases

- Proven track record of improving release processes (e.g., reducing deployment risk, increasing deployment frequency, automating rollbacks)

- Comfort working cross-functionally with product and other engineering teams to debug complex production issues and ship changes safely

- Strong communication and collaboration skills, including writing clear design docs and driving technical decisions across teams

Experience Level: senior Employment Type: full-time Workplace Type: hybrid Category: Engineering Industry: Technology Salary Range: Not stated Salary Min: Not stated Salary Max: Not stated Salary Currency: USD Salary Period: year Required Skills:

- Backend programming languages (e.g., Go, Java, Kotlin, or Python)

- CI/CD and release pipelines (e.g., GitHub Actions, CircleCI, Buildkite, Argo, Spinnaker, Jenkins)

- Containerization and orchestration (e.g., Docker, Kubernetes)

- Infrastructure-as-code (e.g., Terraform, CloudFormation)

- Observability tooling (metrics, logs, tracing)

- Reliability and SRE practices (SLIs/SLOs, error budgets, incident management)

Preferred Skills:

- Emerging tools and practices in release engineering, observability, and SRE

## Skills

### Required
- Backend programming languages (e.g., Go, Java, Kotlin, or Python)
- CI/CD and release pipelines (e.g., GitHub Actions, CircleCI, Buildkite, Argo, Spinnaker, Jenkins)
- Containerization and orchestration (e.g., Docker, Kubernetes)
- Infrastructure-as-code (e.g., Terraform, CloudFormation)
- Observability tooling (metrics, logs, tracing)
- Reliability and SRE practices (SLIs/SLOs, error budgets, incident management)

### Nice to have
- Emerging tools and practices in release engineering, observability, and SRE
