# Senior Deep Learning Scientist, Speech Synthesis

**Company**: NVIDIA
**Location**: Ho Chi Minh City
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Technology

**Apply**: https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/Vietnam-Ho-Chi-Minh-City/Senior-Deep-Learning-Scientist--Speech-Synthesis_JR2016166?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_a535c03f-ff5

## Description

We are looking for a Senior Deep Learning Scientist to develop the high-impact, high-visibility Speech AI product Riva and improve the experience of millions of customers.

As a member of the Riva Product Engineering team, you will train Speech Synthesis mel-spectrogram and vocoder models, measure, benchmark, and analyze model performance, accuracy, and bias, and recommend improvements. You will also maintain the TTS model evaluation system and characterize quality metrics across platforms.

Your responsibilities will include improving processes for speech data processing, augmentation, filtering, and TTS training set preparation, building knowledge of TTS datasets for training and evaluation, collaborating with cross-functional teams on new features, improvements, and issue triage, and participating in code reviews, design reviews, use case reviews, and test plan reviews.

To succeed in this role, you will need a Master's degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, AI, Applied Math, Linguistics, or Computational Linguistics, 5+ years of experience in machine learning and AI model development, strong Python programming skills, and strong knowledge of ML/DL techniques and tools, including CNNs, RNNs/LSTMs, and Transformers.

Experience with PyTorch and familiarity with DSP and feature extraction techniques (FFT, MFCC, Mel spectrograms) is also required, as well as experience with Git, Gerrit, or GitLab, and strong collaboration skills.

## Skills

### Required
- Python
- PyTorch
- CNNs
- RNNs/LSTMs
- Transformers
- DSP
- feature extraction
- Git
- Gerrit
- GitLab

---

Source: [Apply at nvidia.wd5.myworkdayjobs.com](https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/Vietnam-Ho-Chi-Minh-City/Senior-Deep-Learning-Scientist--Speech-Synthesis_JR2016166?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
