关注
Deli Chen
Deli Chen
DeepSeek AI
在 deepseek.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Measuring and relieving the over-smoothing problem for graph neural networks from the topological view
D Chen, Y Lin, W Li, P Li, J Zhou, X Sun
Proceedings of the AAAI conference on artificial intelligence 34 (04), 3438-3445, 2020
11572020
Modeling the stock relation with graph network for overnight stock movement prediction
W Li, R Bao, K Harimoto, D Chen, J Xu, Q Su
Proceedings of the twenty-ninth international conference on international …, 2021
1812021
Deepseek llm: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
1502024
Label words are anchors: An information flow perspective for understanding in-context learning
L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun
arXiv preprint arXiv:2305.14160, 2023
952023
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models
D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ...
arXiv preprint arXiv:2401.06066, 2024
912024
Topology-imbalance learning for semi-supervised node classification
D Chen, Y Lin, G Zhao, X Ren, P Li, J Zhou, X Sun
Advances in Neural Information Processing Systems 34, 29885-29897, 2021
862021
Math-shepherd: Verify and reinforce llms step-by-step without human annotations
P Wang, L Li, Z Shao, R Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
77*2024
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
L Li, Y Lin, D Chen, S Ren, P Li, J Zhou, X Sun
Findings of EMNLP 2021, 2020
45*2020
Towards codable text watermarking for large language models
L Wang, W Yang, D Chen, H Zhou, Y Lin, F Meng, J Zhou, X Sun
arXiv preprint arXiv:2307.15992, 2023
44*2023
Incorporating fine-grained events in stock movement prediction
D Chen, Y Zou, K Harimoto, R Bao, X Ren, X Sun
ECONLP 2019, 2019
442019
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ...
arXiv preprint arXiv:2406.11931, 2024
422024
Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model
A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ...
arXiv preprint arXiv:2405.04434, 2024
302024
Group, extract and aggregate: Summarizing a large amount of finance news for forex movement prediction
D Chen, K Harimoto, R Bao, Q Su, X Sun
ECONLP 2019, 2019
272019
Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification
D Chen, Y Lin, L Li, X Ren, P Li, J Zhou, X Sun
IJCAI 2022, 2020
15*2020
Leveraging word-formation knowledge for Chinese word sense disambiguation
H Zheng, L Li, D Dai, D Chen, T Liu, X Sun, Y Liu
Findings of the Association for Computational Linguistics: EMNLP 2021, 918-923, 2021
132021
Integrating local real data with global gradient prototypes for classifier re-balancing in federated long-tailed learning
W Yang, D Chen, H Zhou, F Meng, J Zhou, X Sun
arXiv preprint arXiv:2301.10394, 2023
72023
Diffusion theory as a scalpel: Detecting and purifying poisonous dimensions in pre-trained language models caused by backdoor or bias
Z Zhang, D Chen, H Zhou, F Meng, J Zhou, X Sun
arXiv preprint arXiv:2305.04547, 2023
52023
HighwayGraph: Modelling long-distance node relations for improving general graph neural network
D Chen, X Liu, Y Lin, P Li, J Zhou, Q Su, X Sun
arXiv preprint arXiv:1911.03904, 2019
42019
Predicting Popular News Comments Based on Multi-Target Text Matching Model
D Chen, S Ma, P Yang, Q Su
Natural Language Processing and Chinese Computing: 8th CCF International …, 2019
3*2019
Fed-FA: theoretically modeling client data divergence for federated language backdoor defense
Z Zhang, D Chen, H Zhou, F Meng, J Zhou, X Sun
Advances in Neural Information Processing Systems 36, 2024
22024
系统目前无法执行此操作,请稍后再试。
文章 1–20