Description
We are looking for a Solutions Architect with experience in GenAI (LLM) model building. This is a highly technical role that requires deep expertise in generative AI, large language models (LLMs), and scalable software engineering practices.
A Solution Architect is the first line of technical expertise between NVIDIA and our customers, as well as our partners. Your duties will vary from solutions design, training/workshops, troubleshooting, project coordination, industry and marketing speaking engagements, customer relationship management and more. You will primarily support Singapore, but will need to support the wider South East Asia region if required.
Key responsibilities include:
- Be an expert and help customers to customize GenAI (LLM) models and optimise the training performance at scale.
- Lead workshops and trainings on NVIDIA's technologies.
- Closely partner with other Solutions Architects, engineering, product and business teams at NVIDIA to build GenAI full stack solutions for industry vertical and enterprise use cases.
- Work with business managers to thoughtfully craft the vision, actionable and effective strategies for the group.
- Encourage industry leaders by articulating the business value from the state of the art in Generative AI.
Requirements include:
- BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
- 8+ overall years of work-related experience in deep learning, data science or software development with knowledge of parallel computing with GPUs. Specifically focusing on generative AI at scale, with emphasis on training Large Language Models (LLMs) at scale.
- Clear written and oral communication skills with the ability to collaborate with management and engineering. Share knowledge with clients, partners and co-workers.
- Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.
- Professional or native language proficiency in English and Mandarin.
- Ability to travel up to 30% of the time to support customer in South East Asia and beyond.
Preferred qualifications include:
- Hands-on experience with NVIDIA's NeMo SDKs or Megatron.
- Demonstrable ability to customize LLM models with new capabilities as well as for training speed, memory efficiency, and resource utilization.
- Familiarity with containerization technologies (e.g., Docker or enroot/pyxis, etc.) and orchestration tools (e.g., Slurm or Kubernetes, etc.) for scalable and efficient model building.