Follow
Roshan Sharma
Roshan Sharma
Research Scientist, Google
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
End-to-end speech summarization using restricted self-attention
R Sharma, S Palaskar, AW Black, F Metze
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
24*2022
SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks
S Shon, S Arora, CJ Lin, A Pasad, F Wu, R Sharma, WL Wu, HY Lee, ...
arXiv preprint arXiv:2212.10525, 2022
162022
Speech recognition in Kannada using HTK and julius: a comparative study
RS Sharma, SH Paladugu, KJ Priya, D Gupta
2019 international conference on communication and signal processing (iccsp …, 2019
142019
A summary of the first workshop on language technology for language documentation and revitalization
G Neubig, S Rijhwani, A Palmer, J MacKenzie, H Cruz, X Li, M Lee, ...
arXiv preprint arXiv:2004.13203, 2020
132020
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
112023
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study
X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model
MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ...
arXiv preprint arXiv:2310.04445, 2023
72023
Speech summarization of long spoken document: Improving memory efficiency of speech/text encoders
T Kano, A Ogawa, M Delcroix, R Sharma, K Matsuura, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
52023
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Xnor-former: Learning accurate approximations in long speech transformers
R Sharma, B Raj
arXiv preprint arXiv:2210.16643, 2022
42022
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction
R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj
Proceedings of the 39th International Conference on Machine Learning 2022 …, 2022
32022
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network
S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ...
arXiv preprint arXiv:2310.02973, 2023
22023
BASS: Block-wise Adaptation for Speech Summarization
R Sharma, S Arora, K Zheng, S Watanabe, R Singh, B Raj
Proc. INTERSPEECH 2023, 1454--1458, 2023
22023
Unifying the discrete and continuous emotion labels for speech emotion recognition
R Sharma, H Dhamyal, B Raj, R Singh
arXiv preprint arXiv:2210.16642, 2022
22022
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems
R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
12023
Evaluating speech synthesis by training recognizers on synthetic speech
D Alharthi, R Sharma, H Dhamyal, S Maiti, B Raj, R Singh
arXiv preprint arXiv:2310.00706, 2023
12023
Augmenting text for spoken language understanding with Large Language Models
R Sharma, S Kim, D Lazar, T Le, A Shrivastava, K Ahn, P Kansal, L Sari, ...
arXiv preprint arXiv:2309.09390, 2023
12023
Egocentric audio-visual noise suppression
R Sharma, W He, J Lin, E Lakomkin, Y Liu, K Kalgaonkar
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Cross-utterance context for multimodal video transcription
R Sharma, B Raj
2022 56th Asilomar Conference on Signals, Systems, and Computers, 1321-1325, 2022
12022
End-to-End Modeling for Abstractive Speech Summarization
R Sharma
Carnegie Mellon University, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20