Deformable convolutional networks J Dai*, H Qi*, Y Xiong*, Y Li*, G Zhang*, H Hu, Y Wei (* co-first author) International Conference on Computer Vision, 2017 | 3996 | 2017 |
Picking Winning Tickets Before Training by Preserving Gradient Flow C Wang, G Zhang, R Grosse International Conference on Learning Representations, 2020 | 334 | 2020 |
Benchmarking Model-Based Reinforcement Learning T Wang, X Bao, I Clavera, J Hoang, Y Wen, E Langlois, S Zhang, G Zhang, ... | 303* | 2019 |
Functional Variational Bayesian Neural Networks S Sun*, G Zhang*, J Shi*, R Grosse (* indicates co-first author) International Conference on Learning Representations, 2019 | 210 | 2019 |
Noisy Natural Gradient as Variational Inference G Zhang*, S Sun*, D Duvenaud, R Grosse (* indicates co-first author) International Conference on Machine Learning, 2018 | 189 | 2018 |
Three Mechanisms of Weight Decay Regularization G Zhang, C Wang, B Xu, R Grosse International Conference on Learning Representations, 2019 | 187 | 2019 |
Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks G Zhang, J Martens, R Grosse Advances in Neural Information Processing Systems, 2019 | 90 | 2019 |
Which algorithmic choices matter at which batch sizes? insights from a noisy quadratic model G Zhang, L Li, Z Nado, J Martens, S Sachdeva, G Dahl, C Shallue, ... Advances in neural information processing systems, 2019 | 90 | 2019 |
On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach Y Wang*, G Zhang*, J Ba (* indicates co-first author) International Conference on Learning Representations, 2020 | 89 | 2020 |
Differentiable Compositional Kernel Learning for Gaussian Processes S Sun, G Zhang, C Wang, W Zeng, J Li, R Grosse International Conference on Machine Learning, 2018 | 75 | 2018 |
EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis C Wang, R Grosse, S Fidler, G Zhang International Conference on Machine Learning, 2019 | 74 | 2019 |
An empirical study of stochastic gradient descent with structured covariance noise Y Wen, K Luk, M Gazeau, G Zhang, H Chan, J Ba International Conference on Artificial Intelligence and Statistics, 3621-3631, 2020 | 40* | 2020 |
Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization G Zhang, Y Wang, L Lessard, R Grosse International Conference on Artificial Intelligence and Statistics (AISTATS), 2022 | 25 | 2022 |
A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints G Zhang, X Bao, L Lessard, R Grosse Journal of Machine Learning Research, 2021 | 20 | 2021 |
Eigenvalue Corrected Noisy Natural Gradient J Bae, G Zhang, R Grosse Neural Information Processing Systems (Bayesian Deep Learning Workshop), 2018 | 19 | 2018 |
On the suboptimality of negative momentum for minimax optimization G Zhang, Y Wang International Conference on Artificial Intelligence and Statistics, 2021 | 13 | 2021 |
Differentiable Annealed Importance Sampling and the Perils of Gradient Noise G Zhang, K Hsu, J Li, C Finn, R Grosse Advances in Neural Information Processing Systems, 2021 | 12 | 2021 |
Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers G Zhang, A Botev, J Martens International Conference on Learning Representations, 2022 | 10 | 2022 |
Learning to give checkable answers with prover-verifier games C Anil, G Zhang, Y Wu, R Grosse arXiv preprint arXiv:2108.12099, 2021 | 5 | 2021 |
Nonnegative matrix cofactorization for weakly supervised image parsing G Zhang, X Gong IEEE Signal Processing Letters 23 (11), 1682-1686, 2016 | 2 | 2016 |