Description
About the Role:
As a Backend Engineer - API at xAI, you will build the xAI API that serves our models to developers worldwide. You will own the end-to-end system responsible for high-throughput inference, handling billions of tokens per minute with low latency and high availability. This includes model serving infrastructure, request routing, SDK development, rate limiting, observability, and efficient scaling.
Responsibilities:
- Build the xAI API that serves our models to developers worldwide
- Own the end-to-end system responsible for high-throughput inference, handling billions of tokens per minute with low latency and high availability, including model serving infrastructure, request routing, SDK development, rate limiting, observability, and efficient scaling
Basic Qualifications:
- Expert knowledge of either Rust or C++
- Experience in designing, implementing, and maintaining reliable and horizontally scalable distributed systems
- Knowledge of service observability and reliability best practices
- Experience in operating commonly used databases such as PostgreSQL, Clickhouse, and MongoDB
Preferred Skills and Experience:
- Experience with LLM inference engines and serving frameworks (e.g., SGLang, TensorRT, vLLM)
- Experience designing or building with agent SDKs and agent orchestration frameworks
- Experience with Docker, Kubernetes, and containerized applications
- Expert knowledge of gRPC (unary, response streaming, bi-directional streaming, REST mapping)
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://job-boards.greenhouse.io/xai/jobs/5120536007