Rémi Munos

Zitiert von

	Alle	Seit 2019
Zitate	40997	32274
h-index	89	79
i10-index	194	158

8000

4000

2000

6000

200720082009201020112012201320142015201620172018201920202021202220232024170 245 225 354 470 531 583 767 802 893 1120 2005 3037 4077 5640 6879 7770 4847

Öffentlicher Zugriff

Alle anzeigen

20 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindBestätigte E-Mail-Adresse bei meta.com
Mohammad Gheshlaghi AzarCohereBestätigte E-Mail-Adresse bei google.com
Marc G. BellemareReliant AI, prev. Google Brain, DeepMindBestätigte E-Mail-Adresse bei reliant.ai
Csaba SzepesvariDeepMind & University of AlbertaBestätigte E-Mail-Adresse bei cs.ualberta.ca
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchBestätigte E-Mail-Adresse bei inria.fr
koray kavukcuogluDeepMindBestätigte E-Mail-Adresse bei kavukcuoglu.org
Odalric-Ambrym MaillardInria Lille - Nord EuropeBestätigte E-Mail-Adresse bei inria.fr
Sebastien BubeckVP GenAI Research, Microsoft AIBestätigte E-Mail-Adresse bei microsoft.com
Andrew MooreDean, School of Computer Science, Carnegie MellonBestätigte E-Mail-Adresse bei cs.cmu.edu
Anna HarutyunyanDeepMindBestätigte E-Mail-Adresse bei google.com
Marc LanctotResearch Scientist, Google DeepMindBestätigte E-Mail-Adresse bei google.com
Tom SchaulSenior Staff Scientist, DeepMindBestätigte E-Mail-Adresse bei nyu.edu
András AntosBudapest University of Technology and EconomicsBestätigte E-Mail-Adresse bei cs.bme.hu
Volodymyr MnihDeepMindBestätigte E-Mail-Adresse bei cs.toronto.edu
Hilbert Johan KappenRadboud UniversityBestätigte E-Mail-Adresse bei science.ru.nl
David SilverDeepMind, UCLBestätigte E-Mail-Adresse bei google.com
Lucian BusoniuProfessor and Group Lead, Automation Department, Technical University of Cluj-NapocaBestätigte E-Mail-Adresse bei aut.utcluj.ro
Andre BarretoResearch Scientist, Google DeepMindBestätigte E-Mail-Adresse bei google.com
Olivier TeytaudfacebookBestätigte E-Mail-Adresse bei fb.com
Sylvain GellyGoogle Brain ZurichBestätigte E-Mail-Adresse bei m4x.org

Folgen

Rémi Munos

Google DeepMind

Bestätigte E-Mail-Adresse bei inria.fr - Startseite

Reinforcement learning RLHF MCTS bandit theory statistical learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Bootstrap your own latent-a new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Advances in neural information processing systems 33, 21271-21284, 2020	6108	2020
A distributional perspective on reinforcement learning MG Bellemare, W Dabney, R Munos International conference on machine learning, 449-458, 2017	1748	2017
Unifying count-based exploration and intrinsic motivation M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos Advances in neural information processing systems 29, 2016	1665	2016
Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures L Espeholt, H Soyer, R Munos, K Simonyan, V Mnih, T Ward, Y Doron, ... International conference on machine learning, 1407-1416, 2018	1601	2018
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1038	2016
Sample efficient actor-critic with experience replay Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ... arXiv preprint arXiv:1611.01224, 2016	992	2016
Best arm identification in multi-armed bandits JY Audibert, S Bubeck COLT-23th Conference on learning theory-2010, 13 p., 2010	927	2010
Minimax regret bounds for reinforcement learning MG Azar, I Osband, R Munos International conference on machine learning, 263-272, 2017	825	2017
Distributional reinforcement learning with quantile regression W Dabney, M Rowland, M Bellemare, R Munos Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	808	2018
Exploration–exploitation tradeoff using variance estimates in multi-armed bandits JY Audibert, R Munos, C Szepesvári Theoretical Computer Science 410 (19), 1876-1902, 2009	778	2009
Thompson sampling: An asymptotically optimal finite-time analysis E Kaufmann, N Korda, R Munos International conference on algorithmic learning theory, 199-213, 2012	770	2012
Safe and efficient off-policy reinforcement learning R Munos, T Stepleton, A Harutyunyan, M Bellemare Advances in neural information processing systems 29, 2016	718	2016
Count-based exploration with neural density models G Ostrovski, MG Bellemare, A Oord, R Munos International conference on machine learning, 2721-2730, 2017	709	2017
Finite-Time Bounds for Fitted Value Iteration. R Munos, C Szepesvári Journal of Machine Learning Research 9 (5), 2008	631	2008
Successor features for transfer in reinforcement learning A Barreto, W Dabney, R Munos, JJ Hunt, T Schaul, HP van Hasselt, ... Advances in neural information processing systems 30, 2017	616	2017
Automated curriculum learning for neural networks A Graves, MG Bellemare, J Menick, R Munos, K Kavukcuoglu international conference on machine learning, 1311-1320, 2017	615	2017
Pure exploration in multi-armed bandits problems S Bubeck, R Munos, G Stoltz Algorithmic Learning Theory: 20th International Conference, ALT 2009, Porto …, 2009	607	2009
Implicit quantile networks for distributional reinforcement learning W Dabney, G Ostrovski, D Silver, R Munos International conference on machine learning, 1096-1105, 2018	578	2018
Modiﬁcation of UCT with Patterns in Monte-Carlo Go S Gelly, Y Wang, R Munos, O Teytaud INRIA, 2006	540	2006
Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018	539	2018

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren