Yonathan Efroni

Cited by

	All	Since 2019
Citations	1442	1431
h-index	20	19
i10-index	25	25

440

220

110

330

20182019202020212022202320246 38 126 243 317 429 277

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Lior ShaniGoogle ResearchVerified email at google.com
Akshay KrishnamurthyUniversity of Massachusetts AmherstVerified email at cs.umass.edu
John LangfordMicrosoft Research New YorkVerified email at hunch.net
Mohammad GhavamzadehAmazonVerified email at amazon.com
Dipendra MisraMicrosoft Research New YorkVerified email at microsoft.com
Constantine CaramanisProfessor of Electrical and Computer Engineering, UT AustinVerified email at utexas.edu
Nadav MerlisPostdoctoral Fellow @ CREST, ENSAE ParisVerified email at ensae.fr
Manan TomarPhD student at University of AlbertaVerified email at ualberta.ca
Alex LambMicrosoft Research (NYC), Université de Montréal, Google Brain, Amazon, Twitch PhD FellowVerified email at microsoft.com
Riashat IslamResearch ScientistVerified email at mail.mcgill.ca
Gal DalalSr. Research Scientist, NvidiaVerified email at nvidia.com
Aviv RosenbergGoogle ResearchVerified email at google.com
Chen TesslerResearch Scientist, NVIDIA ResearchVerified email at nvidia.com
Dylan J. FosterPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Matteo PirottaResearch Scientist, Meta (FAIR)Verified email at fb.com
Alekh AgarwalGoogleVerified email at google.com
Guy TennenholtzResearch Scientist, Google ResearchVerified email at google.com
Uri ShalitAssociate Professor, Technion - Israel Institute of TechnologyVerified email at technion.ac.il
Sobhan MiryoosefiGoogle ResearchVerified email at google.com

Yonathan Efroni

Meta, New York

Verified email at fb.com - Homepage

Reinforcement Learning Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Action Robust Reinforcement Learning and Applications in Continuous Control C Tessler, Y Efroni, S Mannor arXiv preprint arXiv:1901.09184, 2019	227	2019
Adaptive trust region policy optimization: Global convergence and faster rates for regularized mdps L Shani, Y Efroni, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5668-5675, 2020	185	2020
Exploration-exploitation in constrained mdps Y Efroni, S Mannor, M Pirotta arXiv preprint arXiv:2003.02189, 2020	161	2020
Optimistic policy optimization with bandit feedback Y Efroni, L Shani, A Rosenberg, S Mannor arXiv preprint arXiv:2002.08243, 2020	95*	2020
Tight regret bounds for model-based reinforcement learning with greedy policies Y Efroni, N Merlis, M Ghavamzadeh, S Mannor Advances in Neural Information Processing Systems, 12203-12213, 2019	78	2019
Rl for latent mdps: Regret guarantees and a lower bound J Kwon, Y Efroni, C Caramanis, S Mannor Advances in Neural Information Processing Systems 34, 24523-24534, 2021	73	2021
Mirror descent policy optimization M Tomar, L Shani, Y Efroni, M Ghavamzadeh arXiv preprint arXiv:2005.09814, 2020	64	2020
Universality of local weak interactions and its application for interferometric alignment J Dziewior, L Knips, D Farfurnik, K Senkalla, N Benshalom, J Efroni, ... Proceedings of the National Academy of Sciences 116 (8), 2881-2890, 2019	55	2019
Provably filtering exogenous distractors using multistep inverse dynamics Y Efroni, D Misra, A Krishnamurthy, A Agarwal, J Langford International Conference on Learning Representations, 2021	49*	2021
Beyond the one-step greedy approach in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor International Conference on Machine Learning, 1387-1396, 2018	45	2018
Reinforcement learning with trajectory feedback Y Efroni, N Merlis, S Mannor Proceedings of the AAAI conference on artificial intelligence 35 (8), 7288-7295, 2021	40	2021
Multiple-step greedy policies in approximate and online reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Advances in neural information processing systems 31, 2018	40	2018
Provable reinforcement learning with a short-term memory Y Efroni, C Jin, A Krishnamurthy, S Miryoosefi International Conference on Machine Learning, 5832-5850, 2022	37	2022
How to combine tree-search methods in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3494-3501, 2019	35	2019
Guaranteed discovery of control-endogenous latent states with multi-step inverse models A Lamb, R Islam, Y Efroni, A Didolkar, D Misra, D Foster, L Molu, R Chari, ... arXiv preprint arXiv:2207.08229, 2022	33*	2022
Bandits with partially observable offline data G Tennenholtz, U Shalit, S Mannor, Y Efroni arXiv preprint arXiv:2006.06731, 2020	27*	2020
Minimax regret for stochastic shortest path A Cohen, Y Efroni, Y Mansour, A Rosenberg Advances in neural information processing systems 34, 28350-28361, 2021	26	2021
Sample-efficient reinforcement learning in the presence of exogenous information Y Efroni, DJ Foster, D Misra, A Krishnamurthy, J Langford Conference on Learning Theory, 5062-5127, 2022	22	2022
Online planning with lookahead policies Y Efroni, M Ghavamzadeh, S Mannor Advances in Neural Information Processing Systems 33, 14024-14033, 2020	21*	2020
Exploration conscious reinforcement learning revisited L Shani, Y Efroni, S Mannor International conference on machine learning, 5680-5689, 2019	21	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors