Follow
Ziyu Wang
Ziyu Wang
Deepmind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Taking the human out of the loop: A review of Bayesian optimization
B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas
Proceedings of the IEEE 104 (1), 148-175, 2015
49102015
Dueling network architectures for deep reinforcement learning
Z Wang, T Schaul, M Hessel, H Hasselt, M Lanctot, N Freitas
International conference on machine learning, 1995-2003, 2016
46412016
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
40112019
Emergence of locomotion behaviours in rich environments
N Heess, D Tb, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
10572017
Sample efficient actor-critic with experience replay
Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ...
arXiv preprint arXiv:1611.01224, 2016
9422016
Bayesian optimization in a billion dimensions via random embeddings
Z Wang, F Hutter, M Zoghi, D Matheson, N De Feitas
Journal of Artificial Intelligence Research 55, 361-387, 2016
7852016
Alphastar: Mastering the real-time strategy game starcraft ii
O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ...
DeepMind blog 2, 20, 2019
5352019
Deep fried convnets
Z Yang, M Moczulski, M Denil, N De Freitas, A Smola, L Song, Z Wang
Proceedings of the IEEE international conference on computer vision, 1476-1483, 2015
3442015
Reinforcement and imitation learning for diverse visuomotor skills
Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ...
arXiv preprint arXiv:1802.09564, 2018
3402018
Learning an embedding space for transferable robot skills
K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller
International Conference on Learning Representations, 2018
3272018
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
3202020
Playing hard exploration games by watching youtube
Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas
Advances in neural information processing systems 31, 2018
2942018
Critic regularized regression
Z Wang, A Novikov, K Zolna, JS Merel, JT Springenberg, SE Reed, ...
Advances in Neural Information Processing Systems 33, 7768-7778, 2020
2672020
Robust imitation of diverse behaviors
Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess
Advances in Neural Information Processing Systems 30, 2017
2352017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International conference on machine learning, 2912-2921, 2017
2292017
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2252020
Learning human behaviors from motion capture by adversarial imitation
J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ...
arXiv preprint arXiv:1707.02201, 2017
2242017
Adaptive hamiltonian and riemann manifold monte carlo
Z Wang, S Mohamed, N Freitas
International conference on machine learning, 1462-1470, 2013
1532013
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
1452018
Hyperparameter selection for offline reinforcement learning
TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ...
arXiv preprint arXiv:2007.09055, 2020
1412020
The system can't perform the operation now. Try again later.
Articles 1–20