Folgen
Kamil Ciosek
Kamil Ciosek
Spotify
Bestätigte E-Mail-Adresse bei spotify.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Generalization in reinforcement learning with selective noise injection and information bottleneck
M Igl, K Ciosek, Y Li, S Tschiatschek, C Zhang, S Devlin, K Hofmann
Advances in neural information processing systems 32, 2019
1762019
Better exploration with optimistic actor critic
K Ciosek, Q Vuong, R Loftin, K Hofmann
Advances in Neural Information Processing Systems 32, 2019
1562019
Expected policy gradients
K Ciosek, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
842018
Compositional planning using optimal option models
D Silver, K Ciosek
arXiv preprint arXiv:1206.6473, 2012
692012
Discount factor as a regularizer in reinforcement learning
R Amit, R Meir, K Ciosek
International conference on machine learning, 269-278, 2020
652020
Conservative uncertainty estimation by fitting prior networks
K Ciosek, V Fortuin, R Tomioka, K Hofmann, R Turner
International Conference on Learning Representations, 2019
622019
Offer: Off-environment reinforcement learning
K Ciosek, S Whiteson
Proceedings of the aaai conference on artificial intelligence 31 (1), 2017
602017
Multi-task batch reinforcement learning with metric learning
J Li, Q Vuong, S Liu, M Liu, K Ciosek, H Christensen, H Su
Advances in Neural Information Processing Systems 33, 6197-6210, 2020
492020
Expected policy gradients for reinforcement learning
K Ciosek, S Whiteson
Journal of Machine Learning Research 21 (52), 1-51, 2020
472020
Deep interactive bayesian reinforcement learning via meta-learning
L Zintgraf, S Devlin, K Ciosek, S Whiteson, K Hofmann
arXiv preprint arXiv:2101.03864, 2021
382021
Evaluating the robustness of collaborative agents
P Knott, M Carroll, S Devlin, K Ciosek, K Hofmann, AD Dragan, R Shah
arXiv preprint arXiv:2101.05507, 2021
272021
Imitation learning by reinforcement learning
K Ciosek
arXiv preprint arXiv:2108.04763, 2021
232021
Alternating optimisation and quadrature for robust control
S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, M Osborne, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
232018
Amrl: Aggregated memory for reinforcement learning
J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann
International Conference on Learning Representations, 2019
202019
Regularized policies are reward robust
H Husain, K Ciosek, R Tomioka
International Conference on Artificial Intelligence and Statistics, 64-72, 2021
182021
Information directed reward learning for reinforcement learning
D Lindner, M Turchetta, S Tschiatschek, K Ciosek, A Krause
Advances in Neural Information Processing Systems 34, 3850-3862, 2021
152021
Fourier policy gradients
M Fellows, K Ciosek, S Whiteson
International Conference on Machine Learning, 1486-1495, 2018
152018
Drift: Deep reinforcement learning for functional software testing
L Harries, RS Clarke, T Chapman, SV Nallamalli, L Ozgur, S Jain, ...
arXiv preprint arXiv:2007.08220, 2020
142020
Value iteration with options and state aggregation
K Ciosek, D Silver
arXiv preprint arXiv:1501.03959, 2015
142015
Estimating α-rank by maximizing information gain
T Rashid, C Zhang, K Ciosek
Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5673-5681, 2021
82021
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20