Multimodal routing: Improving local and global interpretability of multimodal language analysis YHH Tsai, MQ Ma, M Yang, R Salakhutdinov, LP Morency Proceedings of the Conference on Empirical Methods in Natural Language …, 2020 | 84 | 2020 |
M2lens: Visualizing and explaining multimodal models for sentiment analysis X Wang, J He, Z Jin, M Yang, Y Wang, H Qu IEEE Transactions on Visualization and Computer Graphics 28 (1), 802-812, 2021 | 67 | 2021 |
Self-supervised representation learning with relative predictive coding YHH Tsai, MQ Ma, M Yang, H Zhao, LP Morency, R Salakhutdinov ICLR 2021, 2021 | 36 | 2021 |
Improving lesion segmentation for diabetic retinopathy using adversarial learning Q Xiao, J Zou, M Yang, A Gaudio, K Kitani, A Smailagic, P Costa, M Xu International Conference on Image Analysis and Recognition, 333-344, 2019 | 35 | 2019 |
Complex transformer: A framework for modeling complex-valued sequence M Yang, MQ Ma, D Li, YHH Tsai, R Salakhutdinov ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 32 | 2020 |
Online Continual Learning of End-to-End Speech Recognition Models M Yang, I Lane, S Watanabe Interspeech 2022, 2022 | 20 | 2022 |
Signal transformer: Complex-valued attention and meta-learning for signal recognition Y Peng, Y Dong, M Yang, S Lu, Q Shi ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 8 | 2024 |
Improving Speech Enhancement through Fine-Grained Speech Characteristics M Yang, J Konan, D Bick, A Kumar, S Watanabe, B Raj Interspeech 2022, 2022 | 8 | 2022 |
Paaploss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement M Yang, J Konan, D Bick, Y Zeng, S Han, A Kumar, S Watanabe, B Raj ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Towards noise-tolerant speech-referring video object segmentation: Bridging speech and text X Li, J Wang, X Xu, M Yang, F Yang, Y Zhao, R Singh, B Raj Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 5 | 2023 |
Backdoor attacks with input-unique triggers in nlp X Zhou, J Li, T Zhang, L Lyu, M Yang, J He arXiv preprint arXiv:2303.14325, 2023 | 5 | 2023 |
Simulating realistic speech overlaps improves multi-talker ASR M Yang, N Kanda, X Wang, J Wu, S Sivasankaran, Z Chen, J Li, ... ICASSP 2023, 2022 | 5 | 2022 |
Storing and querying large-scale spatio-temporal graphs with high-throughput edge insertions M Ding, M Yang, S Chen arXiv preprint arXiv:1904.09610, 2019 | 4 | 2019 |
Sequence-level knowledge distillation for class-incremental end-to-end spoken language understanding U Cappellazzo, M Yang, D Falavigna, A Brutti arXiv preprint arXiv:2305.13899, 2023 | 3 | 2023 |
Rethinking Voice-Face Correlation: A Geometry View X Li, Y Wen, M Yang, J Wang, R Singh, B Raj Proceedings of the 31st ACM International Conference on Multimedia, 2458-2467, 2023 | 2 | 2023 |
Taploss: A temporal acoustic parameter loss for speech enhancement BR Y Zeng, J Konan, S Han, D Bick, M Yang, A Kumar, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2* | 2023 |
uSee: Unified Speech Enhancement And Editing with Conditional Diffusion Models M Yang, C Zhang, Y Xu, Z Xu, H Wang, B Raj, D Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Continual Contrastive Spoken Language Understanding U Cappellazzo, E Fini, M Yang, D Falavigna, A Brutti, B Raj arXiv preprint arXiv:2310.02699, 2023 | 1 | 2023 |
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions H Wang, M Yu, H Zhang, C Zhang, Z Xu, M Yang, Y Zhang, D Yu arXiv preprint arXiv:2309.09028, 2023 | 1 | 2023 |
Combining Programmable Potentials and Neural Networks for Materials Problems. R Mohr, AM Avila, S Ghosh, A Bhattarai, M Yang, X Feng, M Head-Gordon, ... AAAI Spring Symposium: MLPS, 2021 | 1 | 2021 |