Joshua Romoff

Zitiert von

	Alle	Seit 2019
Zitate	1582	1504
h-index	12	12
i10-index	13	13

480

240

120

360

2017201820192020202120222023202421 53 100 148 223 335 464 231

Koautoren

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaBestätigte E-Mail-Adresse bei cs.mcgill.ca
Peter HendersonPrinceton UniversityBestätigte E-Mail-Adresse bei princeton.edu
Romain LarocheMicrosoft ResearchBestätigte E-Mail-Adresse bei polytechnique.org
Harm van SeijenSony AIBestätigte E-Mail-Adresse bei sony.com
Mehdi FatemiWand.aiBestätigte E-Mail-Adresse bei wand.ai
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityBestätigte E-Mail-Adresse bei cs.stanford.edu
Michael RabbatResearch Scientist at FacebookBestätigte E-Mail-Adresse bei fb.com
Ahmed TouatiMeta AIBestätigte E-Mail-Adresse bei umontreal.ca
Dan JurafskyProfessor of Linguistics and Computer Science, Stanford UniversityBestätigte E-Mail-Adresse bei stanford.edu
Jieru HuDeepMindBestätigte E-Mail-Adresse bei deepmind.com
Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia TechBestätigte E-Mail-Adresse bei gatech.edu
Dhruv BatraGeorgia Tech | Prev: FAIR (Meta AI)Bestätigte E-Mail-Adresse bei gatech.edu
Theophile GervetCarnegie Mellon UniversityBestätigte E-Mail-Adresse bei andrew.cmu.edu
Jeffrey TsangGoogleBestätigte E-Mail-Adresse bei google.com
Tavian BarnesPhD Student, University of WaterlooBestätigte E-Mail-Adresse bei uwaterloo.ca
Pierre-Luc BaconUniversity of MontrealBestätigte E-Mail-Adresse bei mila.quebec
Maxim PeterResearch engineerBestätigte E-Mail-Adresse bei polytechnique.edu
Emmanuel BengioMcGill UniversityBestätigte E-Mail-Adresse bei mail.mcgill.ca
Alexandre PichéServiceNow Research, MilaBestätigte E-Mail-Adresse bei mail.mcgill.ca
Vincent François-LavetVU AmsterdamBestätigte E-Mail-Adresse bei vu.nl

Folgen

Joshua Romoff

Ubisoft La Forge

Bestätigte E-Mail-Adresse bei ubisoft.com

Reinforcement Learning Deep Learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Towards the systematic reporting of the energy and carbon footprints of machine learning P Henderson, J Hu, J Romoff, E Brunskill, D Jurafsky, J Pineau Journal of Machine Learning Research 21 (248), 1-43, 2020	482	2020
Tarmac: Targeted multi-agent communication A Das, T Gervet, J Romoff, D Batra, D Parikh, M Rabbat, J Pineau International Conference on machine learning, 1538-1546, 2019	404	2019
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	335*	2017
Reward estimation for variance reduction in deep reinforcement learning J Romoff, P Henderson, A Piché, V Francois-Lavet, J Pineau Conference on Robot Learning, 674-699, 2018	51	2018
Where did my optimum go?: An empirical analysis of gradient descent optimization in policy gradient methods P Henderson, J Romoff, J Pineau arXiv preprint arXiv:1810.02525, 2018	45*	2018
Deep reinforcement learning for navigation in aaa video games E Alonso, M Peter, D Goumard, J Romoff Proceedings of the Thirtieth International Joint Conference on Artificial …, 2021	43	2021
Randomized value functions via multiplicative normalizing flows A Touati, H Satija, J Romoff, J Pineau, P Vincent Uncertainty in Artificial Intelligence, 422-432, 2020	42	2020
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning M Assran, J Romoff, N Ballas, J Pineau, M Rabbat Advances in Neural Information Processing Systems, 13299-13309, 2019	34	2019
Direct behavior specification via constrained reinforcement learning J Roy, R Girgis, J Romoff, PL Bacon, C Pal arXiv preprint arXiv:2112.12228, 2021	32	2021
Separation of concerns in reinforcement learning H van Seijen, M Fatemi, J Romoff, R Laroche arXiv preprint arXiv:1612.05159, 2016	29*	2016
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	28	2017
Separating value functions across time-scales J Romoff, P Henderson, A Touati, E Brunskill, J Pineau, Y Ollivier International Conference on Machine Learning, 5468-5477, 2019	26	2019
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau Proceedings of the 20th International Conference on Autonomous Agents and …, 2021	10*	2021
Graph augmented deep reinforcement learning in the gamerland3d environment E Beeching, M Peter, P Marcotte, J Debangoye, O Simonin, J Romoff, ... arXiv preprint arXiv:2112.11731, 2021	9	2021
Deep conditional multi-task learning in atari J Romoff, E Bengio, J Pineau Workshop on Abstraction in Reinforcement Learning at ICML, 2016	5	2016
Learning Computational Efficient Bots with Costly Features A Kobanda, CA Valliappan, J Romoff, L Denoyer 2023 IEEE Conference on Games (CoG), 1-8, 2023	3	2023
Improving Intrinsic Exploration by Creating Stationary Objectives RC Castanyer, J Romoff, G Berseth arXiv preprint arXiv:2310.18144, 2023	2	2023
Decomposing the Bellman Equation in Reinforcement Learning J Romoff McGill University (Canada), 2021	2	2021
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play D Bairamian, P Marcotte, J Romoff, G Robert, D Nowrouzezahrai arXiv preprint arXiv:2311.17190, 2023		2023
About the attractor phenomenon in decomposed reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren