New The Skills of Tomorrow: how AI-exposed is every skill in 2026? See the data →
Okta

Staff Site Reliability Engineer, TCore (FedRamp)

Okta
Apply →
hybrid staff full-time $194,000-$267,000 USD San Francisco, California

First indexed 18 May 2026

Description

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organisations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

The TCore Team

The TCore team is a specialised engineering group that owns and operates all of Okta's networks. They are focused on ensuring the reliability, performance, and security of Okta's core infrastructure, particularly its global traffic entry points and the complete internal networking.

The Staff Site Reliability Engineer Role

We are looking for a Staff Site Reliability Engineer to join the TCore team. The ideal candidate is a self-starter who takes pride in designing and implementing durable solutions to network problems. They are passionate about network responsiveness and performance.

What you’ll be doing

  • Work with various teams to design and implement scalable, and reliable network solutions
  • Maintain a highly available cloud infrastructure edge for the Okta identity platform
  • Collect and analyse data to identify root causes for network-specific events
  • Automate AWS infrastructure with Terraform and/or Chef
  • Evolve the system by introducing changes to improve efficiency, scalability, and velocity

What you’ll bring to the role

  • 8+ years experience in a Cloud Network Engineer role or related
  • Demonstrated in-depth understanding of TCP/IP networking stack; (layer 2 through 7). Ability to implement a highly available VPC network, including inter-vpc connectivity. Working knowledge of stateless and stateful firewalls. Familiar with DNS, web-application firewalls, and various load balancing methods available in the cloud.
  • Deep knowledge of AWS/GCP network concepts such as Transit Gateway / Network Connectivity Center (NCC), Site-to-Site VPN / HA VPN, and Direct Connect / Cloud Interconnect
  • Ability to troubleshoot network issues using AWS VPC flow logs and Cloudwatch metrics, as well as GCP VPC Flow Logs / Cloud Logging, alongside standard packet captures.
  • Experience working with Terraform, Ansible, Chef, Puppet or similar automation tools
  • Proficiency in Bash, Python, Golang, or similar. Experienced with git
  • Able to collaborate effectively with multiple stakeholders
  • Willingness to work on-call

And extra credit if you have experience in any of the following!

  • Experience working in a security-oriented cloud environment
  • Working knowledge of Palo Alto next-gen virtual firewalls, implementation of firewalls, as well as configuration of security policies, routing, and Global Protect.
  • Experience with GCP-specific advanced architecture like Shared VPC topologies, Cloud Router BGP configurations, and Network Connectivity Center (NCC).

Additional requirements

  • This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://job-boards.greenhouse.io/okta/jobs/7674807