Sparseadapter: An easy approach for improving the parameter-efficiency of adapters S He, L Ding, D Dong, M Zhang, D Tao arXiv preprint arXiv:2210.04284, 2022 | 49 | 2022 |
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training LME Team https://github.com/pjlab-sys4nlp/llama-moe, 2023 | 3 | 2023 |
PAD-net: An efficient framework for dynamic networks S He, L Ding, D Dong, B Liu, F Yu, D Tao arXiv preprint arXiv:2211.05528, 2022 | 3 | 2022 |
SD-Conv: Towards the Parameter-Efficiency of Dynamic Convolution S He, C Jiang, D Dong, L Ding IEEE/CVF Winter Conference on Applications of Computer Vision, 2023 (WACV 2023)., 2022 | 2 | 2022 |
iDAT: inverse Distillation Adapter-Tuning J Ruan, J Gao, M Xie, D Dong, S Xiang, T Liu, Y Fu arXiv preprint arXiv:2403.15750, 2024 | | 2024 |
A Graph is Worth Words: Euclideanizing Graph using Pure Transformer Z Gao, D Dong, C Tan, J Xia, B Hu, SZ Li arXiv preprint arXiv:2402.02464, 2024 | | 2024 |