Roshan Sharma

Cited by

	All	Since 2019
Citations	119	119
h-index	7	7
i10-index	5	5

202020212022202320244 4 18 45 48

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Bhiksha RajCarnegie Mellon UniversityVerified email at cs.cmu.edu
Siddhant AroraGraduate Student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Jee-weon JungCarnegie Mellon UniversityVerified email at ieee.org
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Hira DhamyalCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Soumi MaitiCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Yifan PengCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Xinjian LiGoogleVerified email at google.com
William ChenCarnegie Mellon UniversityVerified email at cmu.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Shruti PalaskarAppleVerified email at apple.com
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Hung-yi LeeNational Taiwan UniversityVerified email at ntu.edu.tw
Xuankai ChangCarnegie Mellon University, StudentVerified email at andrew.cmu.edu
Dareen AlharthiResearcher, carnegie mellon universityVerified email at andrew.cmu.edu
Felix WuCharacter AIVerified email at character.ai
Ankita PasadToyota Technological Institute at ChicagoVerified email at ttic.edu
Karen LivescuTTI-ChicagoVerified email at ttic.edu
Suwon ShonASAPPVerified email at csail.mit.edu

Roshan Sharma

Research Scientist, Google

Verified email at google.com - Homepage

Speech Recognition Speech Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
End-to-end speech summarization using restricted self-attention R Sharma, S Palaskar, AW Black, F Metze ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	24*	2022
SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks S Shon, S Arora, CJ Lin, A Pasad, F Wu, R Sharma, WL Wu, HY Lee, ... arXiv preprint arXiv:2212.10525, 2022	16	2022
Speech recognition in Kannada using HTK and julius: a comparative study RS Sharma, SH Paladugu, KJ Priya, D Gupta 2019 international conference on communication and signal processing (iccsp …, 2019	14	2019
A summary of the first workshop on language technology for language documentation and revitalization G Neubig, S Rijhwani, A Palmer, J MacKenzie, H Cruz, X Li, M Lee, ... arXiv preprint arXiv:2004.13203, 2020	13	2020
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	11	2023
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	7	2024
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ... arXiv preprint arXiv:2310.04445, 2023	7	2023
Speech summarization of long spoken document: Improving memory efficiency of speech/text encoders T Kano, A Ogawa, M Delcroix, R Sharma, K Matsuura, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	5	2023
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	4	2024
Xnor-former: Learning accurate approximations in long speech transformers R Sharma, B Raj arXiv preprint arXiv:2210.16643, 2022	4	2022
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj Proceedings of the 39th International Conference on Machine Learning 2022 …, 2022	3	2022
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ... arXiv preprint arXiv:2310.02973, 2023	2	2023
BASS: Block-wise Adaptation for Speech Summarization R Sharma, S Arora, K Zheng, S Watanabe, R Singh, B Raj Proc. INTERSPEECH 2023, 1454--1458, 2023	2	2023
Unifying the discrete and continuous emotion labels for speech emotion recognition R Sharma, H Dhamyal, B Raj, R Singh arXiv preprint arXiv:2210.16642, 2022	2	2022
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	1	2023
Evaluating speech synthesis by training recognizers on synthetic speech D Alharthi, R Sharma, H Dhamyal, S Maiti, B Raj, R Singh arXiv preprint arXiv:2310.00706, 2023	1	2023
Augmenting text for spoken language understanding with Large Language Models R Sharma, S Kim, D Lazar, T Le, A Shrivastava, K Ahn, P Kansal, L Sari, ... arXiv preprint arXiv:2309.09390, 2023	1	2023
Egocentric audio-visual noise suppression R Sharma, W He, J Lin, E Lakomkin, Y Liu, K Kalgaonkar ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	1	2023
Cross-utterance context for multimodal video transcription R Sharma, B Raj 2022 56th Asilomar Conference on Signals, Systems, and Computers, 1321-1325, 2022	1	2022
End-to-End Modeling for Abstractive Speech Summarization R Sharma Carnegie Mellon University, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors