Publications

LIMMITS’24: Multi-speaker, Multi-lingual Indic TTS with voice cloning

Abhayjeet Singh, Amala Nagireddi, G Deekshitha, Jesuraja Bandekar, R Roopa, Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich

IEEE ICASSP Workshops, 2024 [link] [speech synthesis]


A machine‐learning tool to identify bistable states from calcium imaging data

Aalok Varma, Sathvik Udupa, Mohini Sengupta, Prasanta Kumar Ghosh, Vatsala Thirumalai

The Journal of Physiology, 2024 [link] [machine learning]


Lightweight, Multi-speaker, Multi-lingual Indic Text-To-Speech

Abhayjeet Singh, Amala Nagireddi, Anjali Jayakumar, G Deekshitha, Jesuraja Bandekar, R Roopa, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, Hema A Murthy, Heiga Zen, Pranaw Kumar, Kamal Kant, Amol Bole, Bira Chandra Singh, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich

IEEE Open Journal of Signal Processing, 2024 [link] [speech synthesis]


Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models

Sathvik Udupa, Jesuraj Bandekar, Saurabh Kumar, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Raoul Nanavati, Prasanta Kumar Ghosh

INTERSPEECH 2024 [link] [speech recognition]


IndicMOS: Multilingual MOS Prediction for 7 Indian languages

Sathvik Udupa, Soumi Maiti, Prasanta Kumar Ghosh

INTERSPEECH 2024 [link] [speech quality estimation]


Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss

Jesuraj Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh

INTERSPEECH 2024 [link] [speech production]


Gated Multi Encoders and Multitask Objectives for Dialectal Speech Recognition in Indian Languages

Sathvik Udupa, Jesuraja Bandekar, G Deekshitha, Saurabh Kumar, Prasanta Kumar Ghosh, Sandhya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan, Raoul Nanavati

IEEE ASRU 2023 [link] [speech recognition]


Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models

Sathvik Udupa, C Siddarth, Prasanta Kumar Ghosh

IEEE ICASSP 2023 [link] [speech production]


Real-Time MRI Video synthesis from time aligned phonemes with sequence-to-sequence networks

Sathvik Udupa, Prasanta Kumar Ghosh

IEEE ICASSP 2023 [link] [speech production]


Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion

Jesuraj Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh

INTERSPEECH 2023 [link] [speech production]


Streaming model for Acoustic to Articulatory Inversion with transformer networks

Sathvik Udupa, Aravind Illa, Prasanta Kumar Ghosh

INTERSPEECH 2022 [link] [speech production]


Estimating articulatory movements in speech production with transformer networks

Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh

INTERSPEECH 2021 [link] [speech production]