Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning Q Zhang, M Chen, A Bukharin, P He, Y Cheng, W Chen, T Zhao International Conference on Learning Representations, 2023 | 273* | 2023 |
Efficient approximation of deep relu networks for functions on low dimensional manifolds M Chen, H Jiang, W Liao, T Zhao Advances in neural information processing systems 32, 2019 | 117 | 2019 |
Differentiable top-k with optimal transport Y Xie, H Dai, M Chen, B Dai, T Zhao, H Zha, W Wei, T Pfister Advances in Neural Information Processing Systems 33, 20520-20531, 2020 | 115 | 2020 |
Nonparametric regression on low-dimensional manifolds using deep ReLU networks: Function approximation and statistical recovery M Chen, H Jiang, W Liao, T Zhao Information and Inference: A Journal of the IMA 11 (4), 1203-1253, 2022 | 101 | 2022 |
Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data M Chen, K Huang, T Zhao, M Wang International Conference on Machine Learning, 4672-4712, 2023 | 89 | 2023 |
How important is the train-validation split in meta-learning? Y Bai, M Chen, P Zhou, T Zhao, J Lee, S Kakade, H Wang, C Xiong International Conference on Machine Learning, 543-553, 2021 | 82 | 2021 |
On generalization bounds of a family of recurrent neural networks M Chen, X Li, T Zhao arXiv preprint arXiv:1910.12947, 2019 | 70 | 2019 |
Towards understanding the importance of shortcut connections in residual networks T Liu, M Chen, M Zhou, SS Du, E Zhou, T Zhao Advances in neural information processing systems 32, 2019 | 63 | 2019 |
Super tickets in pre-trained language models: From model compression to improving generalization C Liang, S Zuo, M Chen, H Jiang, X Liu, P He, T Zhao, W Chen arXiv preprint arXiv:2105.12002, 2021 | 56 | 2021 |
Distribution approximation and statistical estimation guarantees of generative adversarial networks M Chen, W Liao, H Zha, T Zhao arXiv preprint arXiv:2002.03938, 2020 | 50* | 2020 |
Towards understanding hierarchical learning: Benefits of neural representations M Chen, Y Bai, JD Lee, T Zhao, H Wang, C Xiong, R Socher Advances in Neural Information Processing Systems 33, 22134-22145, 2020 | 50 | 2020 |
On computation and generalization of generative adversarial imitation learning M Chen, Y Wang, T Liu, Z Yang, X Li, Z Wang, T Zhao arXiv preprint arXiv:2001.02792, 2020 | 48 | 2020 |
Large learning rate tames homogeneity: Convergence and balancing effect Y Wang, M Chen, T Zhao, M Tao arXiv preprint arXiv:2110.03677, 2021 | 41 | 2021 |
Besov function approximation and binary classification on low-dimensional manifolds using convolutional residual networks H Liu, M Chen, T Zhao, W Liao International Conference on Machine Learning, 6770-6780, 2021 | 35 | 2021 |
Deep nonparametric estimation of operators between infinite dimensional spaces H Liu, H Yang, M Chen, T Zhao, W Liao Journal of Machine Learning Research 25 (24), 1-67, 2024 | 31 | 2024 |
On computation and generalization of generative adversarial networks under spectrum control H Jiang, Z Chen, M Chen, F Liu, D Wang, T Zhao International Conference on Learning Representations, 2019 | 30* | 2019 |
Diffusion model for data-driven black-box optimization Z Li, H Yuan, K Huang, C Ni, Y Ye, M Chen, M Wang arXiv preprint arXiv:2403.13219, 2024 | 27* | 2024 |
On scalable and efficient computation of large scale optimal transport Y Xie, M Chen, H Jiang, T Zhao, H Zha International Conference on Machine Learning, 6882-6892, 2019 | 26 | 2019 |
An overview of diffusion models: Applications, guided generation, statistical rates and optimization M Chen, S Mei, J Fan, M Wang arXiv preprint arXiv:2404.07771, 2024 | 19 | 2024 |
Sample complexity of nonparametric off-policy evaluation on low-dimensional manifolds using deep networks X Ji, M Chen, M Wang, T Zhao arXiv preprint arXiv:2206.02887, 2022 | 19 | 2022 |