Publications
LIMMITS’24: Multi-speaker, Multi-lingual Indic TTS with voice cloning
Abhayjeet Singh, Amala Nagireddi, G Deekshitha, Jesuraja Bandekar, R Roopa, Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich
IEEE ICASSP Workshops, 2024 [link] [speech synthesis]
A machine‐learning tool to identify bistable states from calcium imaging data
Aalok Varma, Sathvik Udupa, Mohini Sengupta, Prasanta Kumar Ghosh, Vatsala Thirumalai
The Journal of Physiology, 2024 [link] [machine learning]
Lightweight, Multi-speaker, Multi-lingual Indic Text-To-Speech
Abhayjeet Singh, Amala Nagireddi, Anjali Jayakumar, G Deekshitha, Jesuraja Bandekar, R Roopa, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, Hema A Murthy, Heiga Zen, Pranaw Kumar, Kamal Kant, Amol Bole, Bira Chandra Singh, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich
IEEE Open Journal of Signal Processing, 2024 [link] [speech synthesis]
Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models
Sathvik Udupa, Jesuraj Bandekar, Saurabh Kumar, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Raoul Nanavati, Prasanta Kumar Ghosh
INTERSPEECH 2024 [link] [speech recognition]
IndicMOS: Multilingual MOS Prediction for 7 Indian languages
Sathvik Udupa, Soumi Maiti, Prasanta Kumar Ghosh
INTERSPEECH 2024 [link] [speech quality estimation]
Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss
Jesuraj Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh
INTERSPEECH 2024 [link] [speech production]
Gated Multi Encoders and Multitask Objectives for Dialectal Speech Recognition in Indian Languages
Sathvik Udupa, Jesuraja Bandekar, G Deekshitha, Saurabh Kumar, Prasanta Kumar Ghosh, Sandhya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Raoul Nanavati
IEEE ASRU 2023 [link] [speech recognition]
Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models
Sathvik Udupa, C Siddarth, Prasanta Kumar Ghosh
IEEE ICASSP 2023 [link] [speech production]
Real-Time MRI Video synthesis from time aligned phonemes with sequence-to-sequence networks
Sathvik Udupa, Prasanta Kumar Ghosh
IEEE ICASSP 2023 [link] [speech production]
Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion
Jesuraj Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh
INTERSPEECH 2023 [link] [speech production]
Streaming model for Acoustic to Articulatory Inversion with transformer networks
Sathvik Udupa, Aravind Illa, Prasanta Kumar Ghosh
INTERSPEECH 2022 [link] [speech production]
Estimating articulatory movements in speech production with transformer networks
Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh
INTERSPEECH 2021 [link] [speech production]