Yikang Shen

Cited by

	All	Since 2020
Citations	4118	3879
h-index	26	26
i10-index	41	41

1900

950

475

1425

20172018201920202021202220232024202519 52 154 250 357 382 664 1814 410

Public access

View all

9 articles

4 articles

available

not available

Based on funding mandates

Co-authors

Chuang GanUMass Amherst | MIT-IBM Watson AI LabVerified email at csail.mit.edu
Aaron CourvilleProfessor, DIRO, Université de Montréal, Mila, Cifar CAI chairVerified email at umontreal.ca
Zhenfang ChenMIT-IBM Watson AI LabVerified email at cs.hku.hk
Shawn TanMontreal Institute of Learning AlgorithmsVerified email at mila.quebec
Wenge RongBeihang UniversityVerified email at buaa.edu.cn
Zhiqing SunOpenAIVerified email at openai.com
Alessandro SordoniMicrosoft ResearchVerified email at microsoft.com
Shun ZhangVerified email at umich.edu
Yi TayResearch Scientist, Google DeepMindVerified email at google.com
Zhouhan Lin（林洲汉）Shanghai Jiao Tong University; Mila Lab; Facebook AI ResearchVerified email at umontreal.ca
Donald MetzlerGoogle DeepMindVerified email at google.com
Lu YuchenMILA, University of MontrealVerified email at mila.quebec
Chin-Wei HuangMicrosoft ResearchVerified email at microsoft.com
Athul Paul JacobMassachusetts Institute of TechnologyVerified email at mit.edu
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Yue DongUniversity of California RiversideVerified email at ucr.edu
Jackie Chi Kit CheungMcGill UniversityVerified email at cs.mcgill.ca

Yikang Shen

MIT-IBM Watson Lab

Verified email at ibm.com

Deep Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Long range arena: A benchmark for efficient transformers Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri, P Pham, J Rao, L Yang, ... ICLR 2021, 2020	756	2020
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks Y Shen, S Tan, A Sordoni, A Courville ICLR 2019, 2019	412	2019
Principle-driven self-alignment of language models from scratch with minimal human supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan Advances in Neural Information Processing Systems 36, 2511-2565, 2023	339	2023
Aligning large multimodal models with factually augmented rlhf Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... arXiv preprint arXiv:2309.14525, 2023	286	2023
Banditsum: Extractive summarization as a contextual bandit Y Dong, Y Shen, E Crawford, H van Hoof, JCK Cheung EMNLP 2018, 2018	243	2018
Neural language modeling by jointly learning syntax and lexicon Y Shen, Z Lin, CW Huang, A Courville ICLR 2018, 2017	206	2017
Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan arXiv preprint arXiv:2303.05510, 2023	166	2023
Prompting decision transformer for few-shot policy generalization M Xu, Y Shen, S Zhang, Y Lu, D Zhao, J Tenenbaum, C Gan international conference on machine learning, 24631-24645, 2022	162	2022
Transformer-patcher: One mistake worth one neuron Z Huang, Y Shen, X Zhang, J Zhou, W Rong, Z Xiong arXiv preprint arXiv:2301.09785, 2023	159	2023
Gated linear attention transformers with hardware-efficient training S Yang, B Wang, Y Shen, R Panda, Y Kim arXiv preprint arXiv:2312.06635, 2023	135	2023
Straight to the tree: Constituency parsing with neural syntactic distance Y Shen, Z Lin, AP Jacob, A Sordoni, A Courville, Y Bengio ACL 2018, 2018	97	2018
Mod-squad: Designing mixtures of experts as modular multi-task learners Z Chen, Y Shen, M Ding, Z Chen, H Zhao, EG Learned-Miller, C Gan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	96	2023
Graphtext: Graph reasoning in text space J Zhao, L Zhuo, Y Shen, M Qu, K Liu, M Bronstein, Z Zhu, J Tang arXiv preprint arXiv:2310.01089, 2023	79	2023
Question/answer matching for CQA system via combining lexical and sequential information Y Shen, W Rong, Z Sun, Y Ouyang, Z Xiong AAAI 2015, 2015	70	2015
SALMON: Self-alignment with instructable reward models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, D Cox, Y Yang, C Gan arXiv preprint arXiv:2310.05910, 2023	68*	2023
Convolutional neural network based sentiment analysis using Adaboost combination Y Gao, W Rong, Y Shen, Z Xiong 2016 International Joint Conference on Neural Networks (IJCNN), 1333-1338, 2016	65	2016
Word embedding based correlation model for question/answer matching Y Shen, W Rong, N Jiang, B Peng, J Tang, Z Xiong AAAI 2017 31 (1), 2017	57	2017
Easy-to-hard generalization: Scalable alignment beyond human supervision Z Sun, L Yu, Y Shen, W Liu, Y Yang, S Welleck, C Gan arXiv preprint arXiv:2403.09472, 2024	48	2024
Structformer: Joint unsupervised induction of dependency and constituency structure from masked language modeling Y Shen, Y Tay, C Zheng, D Bahri, D Metzler, A Courville ACL 2021, 2020	48	2020
Granite code models: A family of open foundation models for code intelligence M Mishra, M Stallone, G Zhang, Y Shen, A Prasad, AM Soria, M Merler, ... arXiv preprint arXiv:2405.04324, 2024	46	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors