Sheng Shen

Cited by

	All	Since 2019
Citations	5349	5327
h-index	25	25
i10-index	33	33

2700

1350

675

2025

201820192020202120222023202418 43 156 348 862 2692 1215

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Kurt KeutzerProfessor of the Graduate School, EECS, University of California, BerkeleyVerified email at berkeley.edu
Zhewei YaoSnowflakeVerified email at snowflake.com
Michael MahoneyProfessor of Statistics, UC BerkeleyVerified email at stat.berkeley.edu
Amir GholamiResearch Scientist, University of California, BerkeleyVerified email at eecs.berkeley.edu
Trevor DarrellProfessor of Computer Science, U.C. BerkeleyVerified email at eecs.berkeley.edu
Chunyuan LiMicrosoft Research, RedmondVerified email at microsoft.com
Xuanzhe LiuBoya Professor of Computer Science, Peking University, ACM Distinguished ScientistVerified email at pku.edu.cn
Joseph E. GonzalezProfessor of Computer Science, UC BerkeleyVerified email at berkeley.edu
Qiaozhu MeiProfessor, University of MichiganVerified email at umich.edu
Iz BeltagyAllen Institute for Artificial IntelligenceVerified email at beltagy.net
Le HouGoogleVerified email at google.com
Denny ZhouResearch Scientist, Google DeepMindVerified email at google.com
Douwe KielaContextual AI, Stanford UniversityVerified email at stanford.edu
Yaliang LiAlibaba GroupVerified email at alibaba-inc.com
Dan KleinUC Berkeley

Sheng Shen

UC Berkeley

Verified email at berkeley.edu - Homepage

Machine Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multitask prompted training enables zero-shot task generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... ICLR 2022, 2021	1223	2021
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1150	2023
Q-bert: Hessian based ultra low precision quantization of bert S Shen, Z Dong, J Ye, L Ma, Z Yao, A Gholami, MW Mahoney, K Keutzer AAAI 2020, 2019	503	2019
Crosslingual generalization through multitask finetuning N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ... ACL 2023, 2022	370	2022
How Much Can CLIP Benefit Vision-and-Language Tasks? S Shen, LH Li, H Tan, M Bansal, A Rohrbach, KW Chang, Z Yao, ... ICLR 2022, 2021	353	2021
Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers Z Li, E Wallace, S Shen, K Lin, K Keutzer, D Klein, JE Gonzalez ICML 2020, 2020	252	2020
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning Z Yao, A Gholami, S Shen, K Keutzer, MW Mahoney AAAI 2021, 2020	212	2020
Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023	143*	2023
An annotated dataset of literary entities D Bamman, S Popat, S Shen NAACL 2019, 2019	93	2019
Learned token pruning for transformers S Kim, S Shen, D Thorsley, A Gholami, W Kwon, J Hassoun, K Keutzer KDD 2022, 2021	88	2021
Ermes: Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification Z Chen, S Shen, Z Hu, X Lu, Q Mei, X Liu WWW 2019, 2018	80*	2018
What Language Model to Train if You Have One Million GPU Hours? T Le Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ... EMNLP 2022, 2022	78	2022
Powernorm: Rethinking batch normalization in transformers S Shen, Z Yao, A Gholami, M Mahoney, K Keutzer ICML 2020, 2020	78	2020
Through a gender lens: An empirical study of emoji usage over large-scale android users Z Chen, X Lu, S Shen, W Ai, X Liu, Q Mei arXiv preprint arXiv:1705.05546 10 (3178876.3186157), 2017	71	2017
K-lite: Learning transferable visual models with external knowledge S Shen, C Li, X Hu, Y Xie, J Yang, P Zhang, A Rohrbach, Z Gan, L Wang, ... NeurIPS 2022, 2022	65	2022
Pragmatically Informative Text Generation S Shen, D Fried, J Andreas, D Klein NAACL 2019, 2019	65	2019
Aligning large multimodal models with factually augmented rlhf Z Sun, S Shen, S Cao*, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... arXiv preprint arXiv:2309.14525, 2023	60	2023
Poisoning Language Models During Instruction Tuning A Wan, E Wallace, S Shen, D Klein ICML 2023, 2023	58	2023
SqueezeLLM: Dense-and-Sparse Quantization S Kim, C Hooper, A Gholami*, Z Dong, X Li, S Shen, MW Mahoney, ... arXiv preprint arXiv:2306.07629, 2023	54	2023
An Effective Framework for Weakly-Supervised Phrase Grounding Q Wang, H Tan, S Shen, M Mahoney, Z Yao EMNLP 2020, 2020	38*	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors