# Senior DevOps Lead - Cloud & Autonomous System

**Company**: Cyngn
**Location**: Mountain View
**Work arrangement**: remote
**Experience**: senior
**Job type**: full-time
**Salary**: $198,000-225,000 per year
**Category**: Engineering
**Industry**: Technology

**Apply**: https://jobs.lever.co/cyngn/1c31b7d8-cf85-472f-9358-1e10189cf815
**Canonical**: https://yubhub.co/jobs/job_8e582153-6af

## Description

About Cyngn

Cyngn is a publicly-traded autonomous technology company that deploys self-driving industrial vehicles to factories, warehouses, and other facilities throughout North America.

We are a small company with under 100 employees, operating with the energy of a startup. However, we're also publicly traded, which means our employees get access to the liquidity of our publicly-traded equity.

As a Senior DevOps Lead at Cyngn, you will play a vital role in architecting and managing infrastructure across cloud and autonomous vehicle systems. This position combines traditional cloud DevOps leadership with specialized expertise in robotics and autonomous systems infrastructure.

Responsibilities

* Lead and architect cloud and vehicle infrastructure initiatives across AWS and ROS/Linux environments
* Design and implement scalable solutions for both cloud services and autonomous vehicle systems
* Establish and maintain DevOps best practices, CI/CD pipelines, and infrastructure as code
* Drive observability, monitoring, and incident response strategies
* Optimize performance and cost efficiency of cloud and edge computing resources
* Mentor team members and foster a developer-friendly environment
* Manage on-call rotations and incident response processes
* Architect solutions for processing and storing large-scale vehicle telemetry data
* Lead security initiatives and compliance efforts across infrastructure

Requirements

* 10+ years of relevant DevOps/Infrastructure experience
* Proven track record as a technical lead in platform or infrastructure teams
* Advanced expertise in AWS services, infrastructure as code (Terraform), and Kubernetes
* Strong experience with service mesh (Istio) and Helm/Kustomize
* Deep understanding of ROS/ROS2 and Linux kernel configurations
* Experience with GPU configurations and ML infrastructure
* Expertise in ARM and NVIDIA CUDA platform configurations
* Strong programming skills in Python and shell scripting
* Experience with infrastructure automation (Ansible)
* Expertise in CI/CD tools (Jenkins, GitHub Actions)
* Strong system architecture and design skills
* Excellence in technical documentation
* Outstanding problem-solving abilities
* Strong leadership and mentoring capabilities

Nice to haves

* Experience with autonomous vehicle systems
* Track record of optimizing GPU-based ML infrastructure
* Experience with large-scale IoT deployments
* Contributions to open-source projects
* Experience with real-time systems and low-latency requirements
* Expertise in security implementations including SSO, IdP, and AWS Cognito
* Experience with JFrog artifactory and container registry management
* Proficiency in AWS IoT Greengrass
* Experience with container resource management on edge devices
* Understanding of CPU affinity and priority scheduling
* Track record of implementing cost optimization strategies
* Experience with scaling systems both horizontally and vertically

Benefits & Perks

* Health benefits (Medical, Dental, Vision, HSA and FSA (Health & Dependent Daycare), Employee Assistance Program, 1:1 Health Concierge)
* Life, Short-term, and long-term disability insurance (Cyngn funds 100% of premiums)
* Company 401(k)
* Commuter Benefits
* Flexible vacation policy
* Sabbatical leave opportunity after five years with the company
* Paid Parental Leave
* Daily lunches for in-office employees
* Monthly meal and tech allowances for remote employees

## Skills

### Required
- AWS services
- infrastructure as code (Terraform)
- Kubernetes
- service mesh (Istio)
- Helm/Kustomize
- ROS/ROS2
- Linux kernel configurations
- GPU configurations
- ML infrastructure
- ARM
- NVIDIA CUDA platform configurations
- Python
- shell scripting
- infrastructure automation (Ansible)
- CI/CD tools (Jenkins, GitHub Actions)
- system architecture and design skills
- technical documentation
- problem-solving abilities
- leadership and mentoring capabilities

### Nice to have
- autonomous vehicle systems
- optimizing GPU-based ML infrastructure
- large-scale IoT deployments
- open-source projects
- real-time systems and low-latency requirements
- security implementations including SSO, IdP, and AWS Cognito
- JFrog artifactory and container registry management
- AWS IoT Greengrass
- container resource management on edge devices
- CPU affinity and priority scheduling
- cost optimization strategies
- scaling systems both horizontally and vertically
