New The Skills of Tomorrow: how AI-exposed is every skill in 2026? See the data →
NVIDIA

PhD Research Intern, Generative AI

NVIDIA
Apply →
entry internship $30 - $94 per hour Santa Clara

First indexed 28 May 2026

Description

We are building Cosmos world foundation models and generative AI systems for Physical AI across robotics, autonomous driving, smart spaces, and embodied agents. The NVIDIA Cosmos Platform enables multimodal world understanding, simulation, synthetic data generation, and embodied reasoning. We are looking for outstanding PhD interns to help advance the frontier of Physical AI and world models.

Responsibilities

  • Conduct research in generative AI, multimodal foundation models, world models, and embodied AI.
  • Develop algorithms for video understanding/generation, action-conditioned simulation, multimodal reasoning, and policy learning.
  • Train and evaluate large-scale models using video, image, language, and robotics or autonomous driving data.
  • Collaborate with researchers and engineers across AI, robotics, simulation, and graphics teams.
  • Publish research at top conferences and transfer innovations into NVIDIA products.

Requirements

  • Currently pursuing a PhD in CS, EE, Robotics, or related fields.
  • Strong background in generative AI, computer vision, multimodal learning, robotics, or reinforcement learning.
  • Prior publication record and research experience.
  • Strong Python and PyTorch skills.

Ways to Stand Out

  • Experience with large-scale foundation model training.
  • Research in video models, VLMs, world models, robotics, or autonomous driving.
  • Experience with distributed training, simulation, or embodied AI.