关注
Yikang Shen
Yikang Shen
MIT-IBM Watson Lab
在 ibm.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Long range arena: A benchmark for efficient transformers
Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri, P Pham, J Rao, L Yang, ...
ICLR 2021, 2020
6642020
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Y Shen, S Tan, A Sordoni, A Courville
ICLR 2019, 2019
4062019
Principle-driven self-alignment of language models from scratch with minimal human supervision
Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan
Advances in Neural Information Processing Systems 36, 2024
2912024
Banditsum: Extractive summarization as a contextual bandit
Y Dong, Y Shen, E Crawford, H van Hoof, JCK Cheung
EMNLP 2018, 2018
2362018
Neural language modeling by jointly learning syntax and lexicon
Y Shen, Z Lin, CW Huang, A Courville
ICLR 2018, 2017
2052017
Aligning large multimodal models with factually augmented rlhf
Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ...
arXiv preprint arXiv:2309.14525, 2023
1952023
Prompting decision transformer for few-shot policy generalization
M Xu, Y Shen, S Zhang, Y Lu, D Zhao, J Tenenbaum, C Gan
international conference on machine learning, 24631-24645, 2022
1332022
Transformer-patcher: One mistake worth one neuron
Z Huang, Y Shen, X Zhang, J Zhou, W Rong, Z Xiong
arXiv preprint arXiv:2301.09785, 2023
1302023
Planning with large language models for code generation
S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan
arXiv preprint arXiv:2303.05510, 2023
1242023
Straight to the tree: Constituency parsing with neural syntactic distance
Y Shen, Z Lin, AP Jacob, A Sordoni, A Courville, Y Bengio
ACL 2018, 2018
982018
Gated linear attention transformers with hardware-efficient training
S Yang, B Wang, Y Shen, R Panda, Y Kim
arXiv preprint arXiv:2312.06635, 2023
812023
Mod-squad: Designing mixtures of experts as modular multi-task learners
Z Chen, Y Shen, M Ding, Z Chen, H Zhao, EG Learned-Miller, C Gan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
752023
Question/answer matching for CQA system via combining lexical and sequential information
Y Shen, W Rong, Z Sun, Y Ouyang, Z Xiong
AAAI 2015, 2015
722015
Convolutional neural network based sentiment analysis using Adaboost combination
Y Gao, W Rong, Y Shen, Z Xiong
2016 International Joint Conference on Neural Networks (IJCNN), 1333-1338, 2016
652016
SALMON: Self-Alignment with Instructable Reward Models
Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, DD Cox, Y Yang, C Gan
The Twelfth International Conference on Learning Representations, 2024
60*2024
Word embedding based correlation model for question/answer matching
Y Shen, W Rong, N Jiang, B Peng, J Tang, Z Xiong
AAAI 2017 31 (1), 2017
602017
Graphtext: Graph reasoning in text space
J Zhao, L Zhuo, Y Shen, M Qu, K Liu, M Bronstein, Z Zhu, J Tang
arXiv preprint arXiv:2310.01089, 2023
522023
Structformer: Joint unsupervised induction of dependency and constituency structure from masked language modeling
Y Shen, Y Tay, C Zheng, D Bahri, D Metzler, A Courville
ACL 2021, 2020
462020
Hyper-decision transformer for efficient online policy adaptation
M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan
arXiv preprint arXiv:2304.08487, 2023
422023
Mixture of attention heads: Selecting attention heads per token
X Zhang, Y Shen, Z Huang, J Zhou, W Rong, Z Xiong
arXiv preprint arXiv:2210.05144, 2022
332022
系统目前无法执行此操作,请稍后再试。
文章 1–20