Follow
Xin Wang
Title
Cited by
Cited by
Year
Large language models with controllable working memory
D Li, AS Rawat, M Zaheer, X Wang, M Lukasik, A Veit, F Yu, S Kumar
arXiv preprint arXiv:2211.05110, 2022
632022
One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks
Atish Agarwala, Abhimanyu Das, Brendan Juba, Rina Panigrahy, Vatsal Sharan ...
International Conference on Learning Representations, 2021
14*2021
A unified cascaded encoder asr model for dynamic model sizes
S Ding, W Wang, D Zhao, TN Sainath, Y He, R David, R Botros, X Wang, ...
arXiv preprint arXiv:2204.06164, 2022
132022
Sketch based memory for neural networks
R Panigrahy, X Wang, M Zaheer
International Conference on Artificial Intelligence and Statistics, 3169-3177, 2021
102021
A theoretical view on sparsely activated networks
C Baykal, N Dikkala, R Panigrahy, C Rashtchian, X Wang
Advances in Neural Information Processing Systems 35, 30071-30084, 2022
92022
Back and forth error compensation and correction method for linear hyperbolic systems with application to the Maxwell's equations
X Wang, Y Liu
Journal of Computational Physics: X 1, 100014, 2019
72019
On the benefits of learning to route in mixture-of-experts models
N Dikkala, N Ghosh, R Meka, R Panigrahy, N Vyas, X Wang
The 2023 Conference on Empirical Methods in Natural Language Processing, 2023
52023
Improving sampling accuracy of stochastic gradient MCMC methods via non-uniform subsampling of gradients
R Li, X Wang, H Zha, M Tao
arXiv preprint arXiv:2002.08949, 2020
42020
JaxPruner: A concise library for sparsity research
JH Lee, W Park, NE Mitchell, J Pilault, JSO Ceron, HB Kim, N Lee, ...
Conference on Parsimony and Learning, 515-528, 2024
32024
Alternating updates for efficient transformers
C Baykal, D Cutler, N Dikkala, N Ghosh, R Panigrahy, X Wang
Advances in Neural Information Processing Systems 36, 2024
22024
Layernas: Neural architecture search in polynomial complexity
Y Fan, D Alon, J Shen, D Peng, K Kumar, Y Long, X Wang, F Iliopoulos, ...
arXiv preprint arXiv:2304.11517, 2023
22023
Provable hierarchical lifelong learning with a sketch-based modular architecture
Z Deng, Z Fryer, B Juba, R Panigrahy, X Wang
arXiv preprint arXiv:2112.10919, 2021
22021
Sketching based Representations for Robust Image Classification with Provable Guarantees
N Dikkala, SR Karingula, R Meka, J Nelson, R Panigrahy, X Wang
Advances in Neural Information Processing Systems 35, 5459-5470, 2022
12022
Unified Cascaded Encoder ASR model for Dynamic Model Sizes
S Ding, Y He, X Wang, W Wang, T Strohman, TN Sainath, ...
US Patent US20230326461A1, 2023
2023
The Power of External Memory in Increasing Predictive Model Capacity
C Baykal, DJ Cutler, N Dikkala, N Ghosh, R Panigrahy, X Wang
arXiv preprint arXiv:2302.00003, 2023
2023
JAXPruner: A Modular Library for Sparsity Research
ICLR 2023 Workshop on Sparsity in Neural Networks, 2023
2023
One network fits all? Modular versus monolithic task formulations in neural networks
A Das, A Agarwala, B Juba, R Zhang, R Panigrahy, X Wang
2021
The back and forth error compensation and correction method for linear hyperbolic systems and a conservative BFECC limiter
X Wang
Georgia Institute of Technology, 2018
2018
Understanding the Capabilities and Limitations of Neural Networks for Multi-task Learning
V Sharan, X Wang, B Juba, R Panigrahy
The system can't perform the operation now. Try again later.
Articles 1–19