Folgen
Trevor McInroe
Titel
Zitiert von
Zitiert von
Jahr
Temporal disentanglement of representations for improved generalisation in reinforcement learning
M Dunion, T McInroe, KS Luck, JP Hanna, SV Albrecht
arXiv preprint arXiv:2207.05480, 2022
152022
Deep reinforcement learning for multi-agent interaction
IH Ahmed, C Brewitt, I Carlucho, F Christianos, M Dunion, E Fosong, ...
Ai Communications 35 (4), 357-368, 2022
152022
Conditional mutual information for disentangled representations in reinforcement learning
M Dunion, T McInroe, KS Luck, J Hanna, S Albrecht
Advances in Neural Information Processing Systems 36, 2024
102024
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
D Han, T McInroe, A Jelley, SV Albrecht, P Bell, A Storkey
arXiv preprint arXiv:2404.14285, 2024
62024
Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
T McInroe, L Schäfer, SV Albrecht
arXiv preprint arXiv:2110.04935, 2021
52021
Planning to go out-of-distribution in offline-to-online reinforcement learning
T McInroe, SV Albrecht, A Storkey
arXiv preprint arXiv:2310.05723, 2023
22023
Learning representations for control with hierarchical forward models
T McInroe, L Schäfer, SV Albrecht
arXiv preprint arXiv:2206.11396, 2022
12022
Analyzing the Hidden Activations of Deep Policy Networks: Why Representation Matters
TA McInroe, M Spurrier, J Sieber, S Conneely
arXiv preprint arXiv:2103.06398, 2021
12021
Sample Efficiency in Sparse Reinforcement Learning: Or Your Money Back
TA McInroe
arXiv preprint arXiv:2008.12693, 2020
12020
Efficient Offline Reinforcement Learning: The Critic is Critical
A Jelley, T McInroe, S Devlin, A Storkey
arXiv preprint arXiv:2406.13376, 2024
2024
Safe and Efficient Offline Reinforcement Learning: The Critic is Critical
A Jelley, T McInroe, S Devlin, A Storkey
First Reinforcement Learning Safety Workshop, 0
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–11