Folgen
Joshua Romoff
Joshua Romoff
Ubisoft La Forge
Bestätigte E-Mail-Adresse bei ubisoft.com
Titel
Zitiert von
Zitiert von
Jahr
Towards the systematic reporting of the energy and carbon footprints of machine learning
P Henderson, J Hu, J Romoff, E Brunskill, D Jurafsky, J Pineau
The Journal of Machine Learning Research 21 (1), 10039-10081, 2020
3902020
Tarmac: Targeted multi-agent communication
A Das, T Gervet, J Romoff, D Batra, D Parikh, M Rabbat, J Pineau
International Conference on Machine Learning, 1538-1546, 2019
3692019
Hybrid reward architecture for reinforcement learning
H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang
Advances in Neural Information Processing Systems 30, 2017
2712017
Reward estimation for variance reduction in deep reinforcement learning
J Romoff, P Henderson, A Piché, V Francois-Lavet, J Pineau
Conference on Robot Learning, 674-699, 2018
482018
Where did my optimum go?: An empirical analysis of gradient descent optimization in policy gradient methods
P Henderson, J Romoff, J Pineau
arXiv preprint arXiv:1810.02525, 2018
42*2018
Deep reinforcement learning for navigation in aaa video games
E Alonso, M Peter, D Goumard, J Romoff
Proceedings of the Thirtieth International Joint Conference on Artificial …, 2021
402021
Randomized value functions via multiplicative normalizing flows
A Touati, H Satija, J Romoff, J Pineau, P Vincent
Uncertainty in Artificial Intelligence, 422-432, 2020
362020
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
M Assran, J Romoff, N Ballas, J Pineau, M Rabbat
Advances in Neural Information Processing Systems, 13299-13309, 2019
302019
Separating value functions across time-scales
J Romoff, P Henderson, A Touati, E Brunskill, J Pineau, Y Ollivier
International Conference on Machine Learning, 5468-5477, 2019
252019
Multi-advisor reinforcement learning
R Laroche, M Fatemi, J Romoff, H van Seijen
arXiv preprint arXiv:1704.00756, 2017
242017
Separation of concerns in reinforcement learning
H van Seijen, M Fatemi, J Romoff, R Laroche
arXiv preprint arXiv:1612.05159, 2016
21*2016
Direct behavior specification via constrained reinforcement learning
J Roy, R Girgis, J Romoff, PL Bacon, C Pal
arXiv preprint arXiv:2112.12228, 2021
192021
Graph augmented deep reinforcement learning in the GameRLand3D environment
E Beeching, M Peter, P Marcotte, J Debangoye, O Simonin, J Romoff, ...
arXiv preprint arXiv:2112.11731, 2021
92021
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?
J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
9*2021
Deep conditional multi-task learning in atari
J Romoff, E Bengio, J Pineau
Workshop on Abstraction in Reinforcement Learning at ICML, 2016
52016
Decomposing the Bellman Equation in Reinforcement Learning
J Romoff
McGill University (Canada), 2021
22021
Improving Intrinsic Exploration by Creating Stationary Objectives
RC Castanyer, J Romoff, G Berseth
arXiv preprint arXiv:2310.18144, 2023
12023
Learning Computational Efficient Bots with Costly Features
A Kobanda, CA Valliappan, J Romoff, L Denoyer
2023 IEEE Conference on Games (CoG), 1-8, 2023
12023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
D Bairamian, P Marcotte, J Romoff, G Robert, D Nowrouzezahrai
arXiv preprint arXiv:2311.17190, 2023
2023
About the attractor phenomenon in decomposed reinforcement learning
R Laroche, M Fatemi, J Romoff, H van Seijen
2018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20