Description

At NVIDIA, we are closing the "embodiment gap." We don't just build robots; we build digital and physical nervous systems that allow humans to teach robots. You will lead the development of DexUMI (Dexterous Universal Manipulation Interface), a framework that leverages human-worn hardware and advanced computer vision to transfer complex skills from human hands to robotic actuators.

This is a True Full-Stack role in Solutions Architecture Team: you will touch everything from the tactile sensor firmware on a wearable exoskeleton to the cloud-based data pipelines that train our diffusion policies.

Responsibilities

Hardware-Software Co-Design: Maintain and iterate on the DexUMI wearable exoskeleton. You will bridge the kinematics gap between human hands and diverse robot end-effectors (e.g., XHand, Inspire Hand).
Sensor Fusion & Integration: Integrate high-fidelity tactile sensors and IMUs into wearable interfaces. Ensure low-latency data streaming.
Vision & Perception Pipelines: Implement and optimize the "software adaptation" layer,using tools to segment human operators out of training data and robot embodiments.
Data Engineering for AI: Build robust pipelines to collect, clean, and replay dexterous manipulation data for Imitation Learning and Diffusion Policies.
Optimization: Solve bi-level optimization problems to parameterize exoskeleton designs that maximize human wearability while preserving robot-equivalent fingertip workspaces.

Requirements

A MS/PhD in Robotics, Machine Learning, Computer Science, Electrical Engineering, Mechanical Engineering, or a related field (or equivalent experience) with at least 1 year's research and engineering experience.
Proficiency in C/C++ for embedded systems and ROS2, with hands-on experience in tactile sensing, force-feedback (haptics), and motor control.
Experience with CAD (SolidWorks/Fusion360) and rapid prototyping, including 3D printing and PCB design.
Expertise in Python and deep learning frameworks (PyTorch), with familiarity in computer vision (CV) models.
Understanding of Imitation Learning or Reinforcement Learning (RL) is a strong plus.
Experience with Record3D or iPhone-based spatial tracking, enabling integration between perception and physical systems.
Experience working in Ubuntu/Linux environments with high-performance data serialization tools such as Protobuf and MQTT.

This listing is enriched and indexed by YubHub. To apply, use the employer's original posting: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/China-Shanghai/Full-Stack-Solution-Engineer---Sensorized-Human_JR2018291