Description
Join NVIDIA as a Solutions Architect to own the evolution of Agentic AI for the enterprise. You will collaborate with top-tier enterprise software companies to build and deploy sophisticated AI-native systems, focusing on multi-agent coordination, RAG-integrated workflows, and accelerated inference.
As a trusted advisor, you'll transform raw LLM capabilities into high-performance, industry-focused enterprise agents.
Key Responsibilities:
- Build complex agentic systems featuring multi-agent coordination, long-horizon reasoning, and advanced planning frameworks.
- Develop full-scale solutions, including domain-specific enterprise agents and high-performance retrieval pipelines (RAG) spanning various data sources.
- Optimize inference performance by bringing to bear GPU-accelerated frameworks and the full NVIDIA AI infrastructure stack.
- Build hands-on PoCs and reference architectures that serve as the blueprint for production-grade generative AI pipelines.
- Collaborate alongside Enterprise ISVs to integrate NVIDIA software into native platforms, accelerating the deployment of production workloads.
- Collaborate with diverse internal teams to improve NVIDIA software through feedback from real-world implementations.
- Empower partner engineering teams through technical workshops, deep-dive architecture reviews, and developer enablement.
- Scale global expertise by crafting reusable assets and documentation that help field teams deploy agentic AI at scale.
Requirements:
- BS/MS/PhD in Computer Science, Electrical Engineering, AI/ML, or equivalent experience.
- More than 5 years of experience in deep learning, machine learning, or distributed AI systems.
- Strong programming and debugging experience in Python, C/C++, and Linux environments.
- Background in using deep learning libraries like PyTorch or TensorFlow.
- Hands-on experience building LLM and generative AI applications.
- Experience working with agentic or multi-agent AI systems employing frameworks such as LangGraph, LlamaIndex, CrewAI, LangChain, or OpenAI Agents SDK.
- Experience building tool-using AI agents that interact with APIs, databases, and enterprise systems.
- Ability to rapidly prototype AI applications and build scalable GPU-accelerated architectures.
- Excellent interpersonal skills and the ability to collaborate with engineering teams, partners, and executive collaborators.
Nice to Have:
- Experience working with NVIDIA GPUs and AI software, such as NVIDIA NIM, NeMo Framework, NeMo Retriever, and NeMo Agent Toolkit.
- Experience with LLM evaluation frameworks, benchmarking systems, and safety guardrails for agentic workflows.
- Experience optimizing reasoning-focused LLMs through timely engineering, quantization, or benchmarking.
- Familiarity with Kubernetes/OpenShift, CI/CD automation, and cloud-native deployment patterns for AI systems.
- Experience with parallel or distributed computing environments and AI workloads optimized for GPUs.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Solutions-Architect--Agentic-AI_JR2014517