Description
Yuno is seeking an experienced Infrastructure Engineer to own and evolve the infrastructure of its AI agent platform, which provisions, deploys, and manages AI agents at scale on AWS.
The platform is in production and growing, and the ideal candidate will drive architectural decisions, design event-driven communication, improve streaming reliability, build observability, and shape the platform's infrastructure as it grows.
Key responsibilities include:
- Designing and implementing messaging layers for inter-service communication using event-driven architecture
- Owning and automating cloud infrastructure with Infrastructure as Code (IaC)
- Building monitoring, tracing, and alerting systems for platform health
- Evaluating and driving architectural decisions as the platform matures
The successful candidate will have experience with:
- Event-driven architecture and messaging systems (e.g., Kafka, NATS, RabbitMQ)
- AWS (EC2, VPC, IAM, S3, RDS)
- Databases (SQL and NoSQL, e.g., PostgreSQL, MongoDB, Redis)
- Docker and container lifecycle management
- Distributed systems debugging and observability tools (e.g., Datadog)
- Infrastructure as Code (Terraform or Pulumi)
Nice to have skills include experience with AI/MLOps infrastructure, multi-tenant container platforms, Kubernetes, and data pipelines and orchestration.
In return, Yuno offers competitive compensation, remote work options, a home office bonus, stock options, health plans, flexible days off, and opportunities for language, professional, and personal growth.