publications | Saurabhchand Bhati

journal articles

Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

Bhati, Saurabhchand, Villalba, Jesús, Zelasko, Piotr, Moro-Velazquez, Laureano, and Dehak, Najim

IEEE/ACM Transactions on Audio, Speech, and Language Processing 2022

PDF
Regularizing Contrastive Predictive Coding for speech applications

Bhati, Saurabhchand, Villalba, Jesús, Zelasko, Piotr, Moro-Velazquez, Laureano, and Dehak, Najim

In submission to IEEE/ACM TASLP 2023

PDF
Discovering phonetic inventories with crosslingual automatic speech recognition

Żelasko, Piotr, Feng, Siyuan, Velázquez, Laureano Moro, Abavisani, Ali, Bhati, Saurabhchand, Scharenborg, Odette, Hasegawa-Johnson, Mark, and Dehak, Najim

Computer Speech & Language 2022
Unsupervised speech signal-to-symbol transformation for language identification

Bhati, Saurabhchand, Nayak, Shekhar, and Kodukula, Sri Rama Murty

Circuits, Systems, and Signal Processing 2020

conference articles & others

Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning

Bhati, Saurabhchand, Villalba, Jesús, Moro-Velazquez, Laureano, Thebaud, Thomas, and Dehak, Najim

In submission 2023

PDF
Virtual phone discovery for speech synthesis without text

Nayak, Shekhar, Kumar, C Shiva, Ramesh, G, Bhati, Saurabhchand, and Murty, K Sri Rama

In 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
Unsupervised segmentation of speech signals using kernel-gram matrices

Bhati, Saurabhchand, Nayak, Shekhar, and Sri Rama Murty, K

In National Conference on Computer Vision, Pattern Recognition, Image Processing, and Graphics 2017
Modeling sparse spatio-temporal representations for no-reference video quality assessment

Shabeer, P Muhammed, Bhati, Saurabhchand, and Channappayya, Sumohana S

In 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2017
Zero resource speaking rate estimation from change point detection of syllable-like units

Nayak, Shekhar, Bhati, Saurabhchand, and Murty, K Sri Rama

In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
LSTM Siamese network for Parkinson’s disease detection from speech

Bhati, Saurabhchand, Velazquez, Laureano Moro, Villalba, Jesús, and Dehak, Najim

In 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
Speaker Embedding Extraction with Virtual Phonetic Information

Sreekanth, S, Murty, K Sri Rama, Bhati, Saurabhchand, and others,

In 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
Unsupervised Acoustic Segmentation and Clustering Using Siamese Network Embeddings.

Bhati, Saurabhchand, Nayak, Shekhar, Murty, K Sri Rama, and Dehak, Najim

In INTERSPEECH 2019
Segmental contrastive predictive coding for unsupervised word segmentation

Bhati, Saurabhchand, Villalba, Jesús, Żelasko, Piotr, Moro-Velazquez, Laureano, and Dehak, Najim

In INTERSPEECH 2021

PDF
Unsupervised Speech Signal to Symbol Transformation for Zero Resource Speech Applications.

Bhati, Saurabhchand, Nayak, Shekhar, and Murty, K Sri Rama

In INTERSPEECH 2017
Self-expressing autoencoders for unsupervised spoken term discovery

Bhati, Saurabhchand, Villalba, Jesús, Żelasko, Piotr, and Dehak, Najim

In INTERSPEECH 2020
Bottom-Up Unsupervised Word Discovery via Acoustic Units

Bhati, Saurabhchand, Liu, Chunxi, Villalba, Jesús, Trmal, Jan, Khudanpur, Sanjeev, and Dehak, Najim

In 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
An investigation into instantaneous frequency estimation methods for improved speech recognition features

Nayak, Shekhar, Bhati, Saurabhchand, and Murty, K Sri Rama

In 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2017
Instantaneous frequency features for noise robust speech recognition

Nayak, Shekhar, Shashank, Dhar B, Bhati, Saurabhchand, Bramhendra, Koilakuntla, and Murty, K Sri Rama

In 2019 National Conference on Communications (NCC) 2019
Phoneme based embedded segmental k-means for unsupervised term discovery

Bhati, Saurabhch, Kamper, Herman, and Murty, K Sri Rama

In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018