Description

About the Role:

As a Backend Engineer - API at xAI, you will build the xAI API that serves our models to developers worldwide. You will own the end-to-end system responsible for high-throughput inference, handling billions of tokens per minute with low latency and high availability. This includes model serving infrastructure, request routing, SDK development, rate limiting, observability, and efficient scaling.

Responsibilities:

Build the xAI API that serves our models to developers worldwide
Own the end-to-end system responsible for high-throughput inference, handling billions of tokens per minute with low latency and high availability, including model serving infrastructure, request routing, SDK development, rate limiting, observability, and efficient scaling

Basic Qualifications:

Expert knowledge of either Rust or C++
Experience in designing, implementing, and maintaining reliable and horizontally scalable distributed systems
Knowledge of service observability and reliability best practices
Experience in operating commonly used databases such as PostgreSQL, Clickhouse, and MongoDB

Preferred Skills and Experience:

Experience with LLM inference engines and serving frameworks (e.g., SGLang, TensorRT, vLLM)
Experience designing or building with agent SDKs and agent orchestration frameworks
Experience with Docker, Kubernetes, and containerized applications
Expert knowledge of gRPC (unary, response streaming, bi-directional streaming, REST mapping)

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://job-boards.greenhouse.io/xai/jobs/5120536007