publications
journal articles
- Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive CodingIEEE/ACM Transactions on Audio, Speech, and Language Processing 2022
- Regularizing Contrastive Predictive Coding for speech applicationsIn submission to IEEE/ACM TASLP 2023
- Discovering phonetic inventories with crosslingual automatic speech recognitionComputer Speech & Language 2022
- Unsupervised speech signal-to-symbol transformation for language identificationCircuits, Systems, and Signal Processing 2020
conference articles & others
- Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual LearningIn submission 2023
- Virtual phone discovery for speech synthesis without textIn 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
- Unsupervised segmentation of speech signals using kernel-gram matricesIn National Conference on Computer Vision, Pattern Recognition, Image Processing, and Graphics 2017
- Modeling sparse spatio-temporal representations for no-reference video quality assessmentIn 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2017
- Zero resource speaking rate estimation from change point detection of syllable-like unitsIn ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
- LSTM Siamese network for Parkinson’s disease detection from speechIn 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
- Speaker Embedding Extraction with Virtual Phonetic InformationIn 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
- Unsupervised Acoustic Segmentation and Clustering Using Siamese Network Embeddings.In INTERSPEECH 2019
-
- Unsupervised Speech Signal to Symbol Transformation for Zero Resource Speech Applications.In INTERSPEECH 2017
- Self-expressing autoencoders for unsupervised spoken term discoveryIn INTERSPEECH 2020
- Bottom-Up Unsupervised Word Discovery via Acoustic UnitsIn 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
- An investigation into instantaneous frequency estimation methods for improved speech recognition featuresIn 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2017
- Instantaneous frequency features for noise robust speech recognitionIn 2019 National Conference on Communications (NCC) 2019
- Phoneme based embedded segmental k-means for unsupervised term discoveryIn 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018