Fake Audio Detection in Resource-Constrained Settings Using Microfeatures. H Dhamyal, A Ali, IA Qazi, AA Raza Interspeech, 4149-4153, 2021 | 13 | 2021 |
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ... arXiv preprint arXiv:2310.04445, 2023 | 11 | 2023 |
Describing emotions with acoustic property prompts for speech emotion recognition H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh arXiv preprint arXiv:2211.07737, 2022 | 9 | 2022 |
The phonetic bases of vocal expressed emotion: natural versus acted H Dhamyal, SA Memon, B Raj, R Singh arXiv preprint arXiv:1911.05733, 2019 | 7 | 2019 |
Detecting gender differences in perception of emotion in crowdsourced data SA Memon, H Dhamyal, O Wright, D Justice, V Palat, W Boler, B Raj, ... arXiv preprint arXiv:1910.11386, 2019 | 5 | 2019 |
Using self attention dnns to discover phonemic features for audio deep fake detection H Dhamyal, A Ali, IA Qazi, AA Raza 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 4 | 2021 |
On the Evaluation of Speech Foundation Models for Spoken Language Understanding S Arora, A Pasad, CM Chien, J Han, R Sharma, J Jung, H Dhamyal, ... arXiv preprint arXiv:2406.10083, 2024 | 3 | 2024 |
Unifying the discrete and continuous emotion labels for speech emotion recognition R Sharma, H Dhamyal, B Raj, R Singh arXiv preprint arXiv:2210.16642, 2022 | 3 | 2022 |
Self-supervision and learnable strfs for age, emotion, and country prediction R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj arXiv preprint arXiv:2206.12568, 2022 | 3 | 2022 |
Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection H Dhamyal, B Raj, R Singh Proc. Interspeech 2022, 166-170, 2022 | 3 | 2022 |
Optimizing neural network embeddings using a pair-wise loss for text-independent speaker verification H Dhamyal, T Zhou, B Raj, R Singh 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 3 | 2019 |
Prompting Audios Using Acoustic Properties for Emotion Representation H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech D Alharthi, R Sharma, H Dhamyal, S Maiti, B Raj, R Singh arXiv preprint arXiv:2310.00706, 2023 | 2 | 2023 |
Masked proxy loss for text-independent speaker verification J Lian, AV Kumar, H Dhamyal, B Raj, R Singh arXiv preprint arXiv:2011.04491, 2020 | 2 | 2020 |
SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios H Bukhari, S Deshmukh, H Dhamyal, B Raj, R Singh arXiv preprint arXiv:2407.15300, 2024 | 1 | 2024 |
On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice A Shah, H Dhamyal, Y Gao, D Arancibia, M Arancibia, B Raj, R Singh arXiv preprint arXiv:2204.04802, 2022 | 1 | 2022 |
An overview of techniques for biomarker discovery in voice signal R Singh, A Shah, H Dhamyal arXiv preprint arXiv:2110.04678, 2021 | 1 | 2021 |
PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification M Baali, A Aldoobi, H Dhamyal, R Singh, B Raj arXiv preprint arXiv:2409.05799, 2024 | | 2024 |
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? R Sharma, S Shon, M Lindsey, H Dhamyal, R Singh, B Raj arXiv preprint arXiv:2408.07277, 2024 | | 2024 |
R-BASS: Relevance-aided Block-wise Adaptation for Speech Summarization R Sharma, R Sharma, H Dhamyal, R Singh, B Raj Findings of the Association for Computational Linguistics: NAACL 2024, 848-857, 2024 | | 2024 |