Description
Elastic is building Agent Builder, a conversational platform that connects production agents to real customer business data in Elasticsearch. As a Principal Engineer, you will set technical direction and drive the Kibana backend architecture for the agentic platform: streaming APIs, secure tool execution, session and memory persistence, retrieval and citations contracts, and evaluation telemetry.
Your influence will extend beyond a single feature, shaping service boundaries, reliability posture, and standards that other solutions build on.
Responsibilities
- Own the architecture for chat back-end services (Node/TypeScript), defining service boundaries, data contracts, and scalability targets
- Lead cross-team design reviews; author ADRs and RFCs that become reference standards for AI-chat and ingestion work
- Build and harden event-driven pipelines that capture chat telemetry, evaluation traces, and LLM feedback loops; expose them via self-service analytics endpoints
- Champion reliability,define error budgets, introduce testing strategy, and steer incident-response playbooks for conversational workloads
- Mentor senior and Junior engineers; grow their system-design skills and foster a high-trust, low-ego culture
- Partner with Product, Design, and Data Science to translate ambiguous goals (e.g., “multi-step reasoning with tool calling”) into incremental, testable action items
- Represent Elastic in open-source AI communities (LangGraph/LangChain, MCP/A2A) through design proposals, blog posts, and conference talks
What You Bring
- 10 + years building distributed, production SaaS services,at least 5 years leading large-scale Node/TypeScript or similar back-end stacks
- Deep expertise in distributed systems fundamentals,shard routing, consensus, eventual consistency, back-pressure, and circuit-breaker patterns
- Demonstrated success designing high-throughput, low-latency APIs (gRPC / REST / WebSocket),including streaming responses and resumable sessions
- Hands-on experience with observability: OpenTelemetry, log/metric pipelines, synthetic checks, and SLO dashboards
- Exposure to LLM tooling (LangChain/LangGraph, OpenAI function calls, vector-search, RAG orchestration) and enthusiasm for advancing GenAI architectures
- Clear, persuasive written communication,your ADRs and RFCs set the standard others emulate
Benefits
- Competitive pay based on the work you do here and not your previous salary
- Health coverage for you and your family in many locations
- Ability to craft your calendar with flexible locations and schedules for many roles
- Generous number of vacation days each year
- Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service
- Up to 40 hours each year to use toward volunteer projects you love
- Embracing parenthood with minimum of 16 weeks of parental leave
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://job-boards.greenhouse.io/elastic/jobs/7815865