RoFormer: Enhanced Transformer with Rotary Position Embedding J Su, Y Lu, S Pan, B Wen, Y Liu arXiv preprint arXiv:2104.09864, 2021 | 2397 | 2021 |
A novel cascade binary tagging framework for relational triple extraction Z Wei, J Su, Y Wang, Y Tian, Y Chang arXiv preprint arXiv:1909.03227, 2019 | 710 | 2019 |
Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images R Li, S Zheng, C Zhang, C Duan, J Su, L Wang, PM Atkinson IEEE Transactions on Geoscience and Remote Sensing, 2021 | 423 | 2021 |
Whitening sentence representations for better semantics and faster retrieval J Su, J Cao, W Liu, Y Ou arXiv preprint arXiv:2103.15316, 2021 | 375 | 2021 |
Multistage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images R Li, S Zheng, C Duan, J Su, C Zhang IEEE Geoscience and Remote Sensing Letters, 2021 | 320 | 2021 |
Global Pointer: Novel Efficient Span-based Approach for Named Entity Recognition J Su, A Murtadha, S Pan, J Hou, J Sun, W Huang, B Wen, Y Liu arXiv preprint arXiv:2208.03054, 2022 | 164 | 2022 |
Kimi k1. 5: Scaling Reinforcement Learning with LLMs K Team, A Du, B Gao, B Xing, C Jiang, C Chen, C Li, C Xiao, C Du, C Liao, ... arXiv preprint arXiv:2501.12599, 2025 | 117 | 2025 |
ZLPR: A Novel Loss for Multi-label Classification J Su, M Zhu, A Murtadha, S Pan, B Wen, Y Liu arXiv preprint arXiv:2208.02955, 2022 | 86 | 2022 |
A batch normalized inference network keeps the kl vanishing away Q Zhu, J Su, W Bi, X Liu, X Ma, X Li, D Wu arXiv preprint arXiv:2004.12585, 2020 | 80 | 2020 |
Linear attention mechanism: An efficient attention for semantic segmentation R Li, J Su, C Duan, S Zheng arXiv preprint arXiv:2007.14902, 2020 | 51 | 2020 |
Rectified exponential units for convolutional neural networks Y Ying, J Su, P Shan, L Miao, X Wang, S Peng IEEE Access 7, 101633-101640, 2019 | 49 | 2019 |
Dual-discriminative graph neural network for imbalanced graph-level anomaly detection G Zhang, Z Yang, J Wu, J Yang, S Xue, H Peng, J Su, C Zhou, QZ Sheng, ... Advances in Neural Information Processing Systems, 2022 | 43 | 2022 |
Graph entropy guided node embedding dimension selection for graph neural networks G Luo, J Li, J Su, H Peng, C Yang, L Sun, PS Yu, L He arXiv preprint arXiv:2105.03178, 2021 | 43 | 2021 |
ZARTS: On Zero-order Optimization for Neural Architecture Search X Wang, W Guo, J Yan, J Su, X Yang arXiv preprint arXiv:2110.04743, 2021 | 39 | 2021 |
Elucidating the exposure bias in diffusion models M Ning, M Li, J Su, AA Salah, IO Ertugrul arXiv preprint arXiv:2308.15321, 2023 | 37 | 2023 |
SimBERT: integrating retrieval and generation into BERT J Su Technical report, 2020 | 29 | 2020 |
Gan-qp: A novel gan framework without gradient vanishing and lipschitz constraint J Su arXiv preprint arXiv:1811.07296, 2018 | 29 | 2018 |
RoFormer: enhanced transformer with rotary position embedding. arXiv (2021) J Su, Y Lu, S Pan, A Murtadha, B Wen, Y Liu arXiv preprint arXiv:2104.09864, 0 | 29 | |
Evaluating Generalization Ability of Convolutional Neural Networks and Capsule Networks for Image Classification via Top-2 Classification H Ren, J Su, H Lu arXiv preprint arXiv:1901.10112, 2019 | 28 | 2019 |
A Novel Hierarchical Binary Tagging Framework for Joint Extraction of Entities and Relations Z Wei, J Su, Y Wang, Y Tian, Y Chang arXiv preprint arXiv:1909.03227, 2019 | 27 | 2019 |