How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval SC Lin, A Asai, M Li, B Oguz, J Lin, Y Mehdad, W Yih, X Chen Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023 | 62 | 2023 |
Simple and effective unsupervised redundancy elimination to compress dense vectors for passage retrieval X Ma, M Li, K Sun, J Xin, J Lin Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 26 | 2021 |
Aggretriever: A simple approach to aggregate textual representations for robust dense passage retrieval SC Lin, M Li, J Lin Transactions of the Association for Computational Linguistics 11, 436-452, 2023 | 24 | 2023 |
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval M Li, SC Lin, B Oguz, A Ghoshal, J Lin, Y Mehdad, W Yih, X Chen Proceedings of the 61st Annual Meeting of the Association for Computational …, 2022 | 22 | 2022 |
Another look at DPR: reproduction of training and replication of retrieval X Ma, K Sun, R Pradeep, M Li, J Lin European Conference on Information Retrieval, 613-626, 2022 | 22 | 2022 |
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes M Li, SC Lin, X Ma, J Lin Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023 | 16 | 2023 |
Multi-task dense retrieval via model uncertainty fusion for open-domain question answering M Li, M Li, K Xiong, J Lin Findings of the Association for Computational Linguistics: EMNLP 2021, 274-287, 2021 | 13 | 2021 |
Query expansion using contextual clue sampling with language models L Liu, M Li, J Lin, S Riedel, P Stenetorp arXiv preprint arXiv:2210.07093, 2022 | 12 | 2022 |
Certified error control of candidate set pruning for two-stage relevance ranking M Li, X Zhang, J Xin, H Zhang, J Lin Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 8 | 2022 |
Encoder adaptation of dense passage retrieval for open-domain question answering M Li, J Lin arXiv preprint arXiv:2110.01599, 2021 | 8 | 2021 |
An encoder attribution analysis for dense passage retriever in open-domain question answering M Li, X Ma, J Lin Proceedings of the 2nd Workshop on Trustworthy Natural Language Processing …, 2022 | 7 | 2022 |
Generate, filter, and fuse: Query expansion via multi-step keyword generation for zero-shot neural rankers M Li, H Zhuang, K Hui, Z Qin, J Lin, R Jagerman, X Wang, M Bendersky arXiv preprint arXiv:2311.09175, 2023 | 4 | 2023 |
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution M Li, X Chen, A Holtzman, B Chen, J Lin, W Yih, XV Lin arXiv preprint arXiv:2405.19325, 2024 | 3 | 2024 |
Accelerating large scale knowledge distillation via dynamic importance sampling M Li, T Zuo, R Li, M White, W Zheng arXiv preprint arXiv:1812.00914, 2018 | 3 | 2018 |
Unifying Multimodal Retrieval via Document Screenshot Embedding X Ma, SC Lin, M Li, W Chen, J Lin arXiv preprint arXiv:2406.11251, 2024 | 2 | 2024 |
Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction X Zhang, M Li, J Lin arXiv preprint arXiv:2302.06589, 2023 | 2 | 2023 |
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures M Li, H Bai, L Tan, K Xiong, J Lin arXiv preprint arXiv:2010.11351, 2020 | 1 | 2020 |
Pretrained Transformers for Efficient and Robust Information Retrieval M Li University of Waterloo, 2024 | | 2024 |
CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders. C Zhang, M Li, J Lin Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 | | 2024 |