Shruti Palaskar
Shruti Palaskar
Graduate Student at Carnegie Mellon University
在 cs.cmu.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
How2: a large-scale dataset for multimodal language understanding
R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ...
arXiv preprint arXiv:1811.00347, 2018
1122018
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking rosetta” JSALT 2017 workshop
O Scharenborg, L Besacier, A Black, M Hasegawa-Johnson, F Metze, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
32*2018
Multimodal abstractive summarization for how2 videos
S Palaskar, J Libovický, S Gella, F Metze
arXiv preprint arXiv:1906.07901, 2019
282019
Combining LSTM and latent topic modeling for mortality prediction
Y Jo, L Lee, S Palaskar
arXiv preprint arXiv:1709.02842, 2017
282017
Cmu sinbad’s submission for the dstc7 avsd challenge
R Sanabria, S Palaskar, F Metze
DSTC7 at AAAI2019 workshop 6, 2019
252019
End-to-end multimodal speech recognition
S Palaskar, R Sanabria, F Metze
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
252018
ASR error correction and domain adaptation using machine translation
A Mani, S Palaskar, NV Meripo, S Konam, F Metze
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
232020
Building an asr system for a low-resource language through the adaptation of a high-resource language asr system: Preliminary results
O Scharenborg, F Ciannella, S Palaskar, A Black, F Metze, L Ondel, ...
Proceedings of ICNLSSP, Casablanca, Morocco, 2017
222017
Acoustic-to-word recognition with sequence-to-sequence models
S Palaskar, F Metze
2018 IEEE Spoken Language Technology Workshop (SLT), 397-404, 2018
182018
Multimodal grounding for sequence-to-sequence speech recognition
O Caglayan, R Sanabria, S Palaskar, L Barraul, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
152019
Multimodal abstractive summarization for open-domain videos
J Libovický, S Palaskar, S Gella, F Metze
Proceedings of the Workshop on Visually Grounded Interaction and Language …, 2018
142018
Learned in speech recognition: Contextual acoustic word embeddings
S Palaskar, V Raunak, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
132019
Learning from multiview correlations in open-domain videos
N Holzenberger, S Palaskar, P Madhyastha, F Metze, R Arora
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
122019
Towards understanding ASR error correction for medical conversations
A Mani, S Palaskar, S Konam
Proceedings of the First Workshop on Natural Language Processing for Medical …, 2020
92020
How2Sign: a large-scale multimodal dataset for continuous American sign language
A Duarte, S Palaskar, L Ventura, D Ghadiyaram, K DeHaan, F Metze, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
32021
Transfer learning for multimodal dialog
S Palaskar, R Sanabria, F Metze
Computer Speech & Language 64, 101093, 2020
32020
Grounded Sequence to Sequence Transduction
L Specia, L Barrault, O Caglayan, A Duarte, D Elliott, S Gella, ...
IEEE journal of selected topics in signal processing 14 (3), 577-591, 2020
32020
Multimodal Speech Summarization Through Semantic Concept Learning}}
S Palaskar, R Salakhutdinov, AW Black, F Metze
Proc. Interspeech 2021, 791-795, 2021
12021
Speech Summarization using Restricted Self-Attention
R Sharma, S Palaskar, AW Black, F Metze
arXiv preprint arXiv:2110.06263, 2021
2021
Multimodal Learning from Videos
S Palaskar
2021
系统目前无法执行此操作,请稍后再试。
文章 1–20