Prashanth L.A.

Zitiert von

	Alle	Seit 2019
Zitate	2267	1505
h-index	18	17
i10-index	31	28

380

190

285

201120122013201420152016201720182019202020212022202320249 15 50 64 81 99 76 127 180 207 272 292 380 173

Öffentlicher Zugriff

Alle anzeigen

19 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceBestätigte E-Mail-Adresse bei iisc.ac.in
Michael C. FuUniversity of MarylandBestätigte E-Mail-Adresse bei umd.edu
Mohammad GhavamzadehAmazonBestätigte E-Mail-Adresse bei amazon.com
Krishna JagannathanProfessor, Department of Electrical Engineering, IIT MadrasBestätigte E-Mail-Adresse bei ee.iitm.ac.in
H L PrasadChairman and CTO at Astrome TechnologiesBestätigte E-Mail-Adresse bei csa.iisc.ernet.in
Rémi MunosGoogle DeepMindBestätigte E-Mail-Adresse bei inria.fr
Ravi Kumar KollaIIT MadrasBestätigte E-Mail-Adresse bei ee.iitm.ac.in
Csaba SzepesvariDeepMind & University of AlbertaBestätigte E-Mail-Adresse bei cs.ualberta.ca
Sanjay P. BhatTata Consultancy Services LimitedBestätigte E-Mail-Adresse bei tcs.com
Cheng JiePinterest LLC, University of Maryland, College Park, Walmart Global TechBestätigte E-Mail-Adresse bei pinterest.com
Nirmit DesaiIBM ResearchBestätigte E-Mail-Adresse bei us.ibm.com
Nirav BhavsarM.S. Scholar in the Department of Computer Science and Engineering, Indian Institute of TechnologyBestätigte E-Mail-Adresse bei cse.iitm.ac.in
Nithia VijayanResearch Fellow, School of Computing, National University of SingaporeBestätigte E-Mail-Adresse bei comp.nus.edu.sg
Aditya GopalanIndian Institute of Science, BangaloreBestätigte E-Mail-Adresse bei iisc.ac.in
Doina PrecupDeepMind and McGill UniversityBestätigte E-Mail-Adresse bei cs.mcgill.ca
gargi dasguptaIBM Research LabBestätigte E-Mail-Adresse bei in.ibm.com
Gandharv PatilMcGill University, MilaBestätigte E-Mail-Adresse bei mail.mcgill.ca
Dheeraj NagarajResearch Scientist, GoogleBestätigte E-Mail-Adresse bei google.com
Steven I. MarcusProfessor of Electrical and Computer Engineering, University of MarylandBestätigte E-Mail-Adresse bei umd.edu
Andras GyorgyDeepMindBestätigte E-Mail-Adresse bei google.com

Folgen

Prashanth L.A.

Associate Professor, Department of Computer Science and Engg., IIT Madras

Bestätigte E-Mail-Adresse bei cse.iitm.ac.in - Startseite

Reinforcement learning simulation optimization multi-armed bandits


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods S Bhatnagar, HL Prasad, LA Prashanth Springer 434, 302, 2013	425*	2013
Reinforcement Learning With Function Approximation for Traffic Signal Control P LA, S Bhatnagar Intelligent Transportation Systems, IEEE Transactions on, 1-10, 2011	386	2011
Actor-critic algorithms for risk-sensitive MDPs P La, M Ghavamzadeh Advances in neural information processing systems 26, 2013	309	2013
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	90	2011
Cumulative prospect theory meets reinforcement learning: Prediction and control LA Prashanth, C Jie, M Fu, S Marcus, C Szepesvári International Conference on Machine Learning, 1406-1415, 2016	88	2016
Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs LA Prashanth, M Ghavamzadeh arXiv preprint arXiv:1403.6530, 2014	82	2014
Policy gradients for CVaR-constrained MDPs LA Prashanth International Conference on Algorithmic Learning Theory, 155-169, 2014	72	2014
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	71	2015
Concentration of risk measures: A Wasserstein distance approach SP Bhat, P LA Advances in neural information processing systems 32, 2019	56	2019
Concentration bounds for empirical conditional value-at-risk: The unbounded case RK Kolla, LA Prashanth, SP Bhat, K Jagannathan Operations Research Letters 47 (1), 16-20, 2019	56	2019
Threshold tuning using stochastic optimization for graded signal control LA Prashanth, S Bhatnagar IEEE Transactions on Vehicular Technology 61 (9), 3865-3880, 2012	54	2012
Stochastic optimization in a cumulative prospect theory framework C Jie, LA Prashanth, M Fu, S Marcus, C Szepesvári IEEE Transactions on Automatic Control 63 (9), 2867-2882, 2018	52	2018
On TD (0) with function approximation: Concentration bounds and a centered variant with exponential convergence N Korda, P La International conference on machine learning, 626-634, 2015	52	2015
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions LA Prashanth, K Jagannathan, RK Kolla Proceedings of the 37th International Conference on Machine Learning, 5577-5586, 2020	51	2020
Adaptive system optimization using random directions stochastic approximation LA Prashanth, S Bhatnagar, M Fu, S Marcus IEEE Transactions on Automatic Control 62 (5), 2223-2238, 2017	37	2017
Risk-sensitive reinforcement learning: A constrained optimization viewpoint LA Prashanth, M Fu arXiv 2018, 2018	35	2018
Risk-sensitive reinforcement learning via policy gradient search LA Prashanth, MC Fu Foundations and Trends® in Machine Learning 15 (5), 537-693, 2022	29	2022
Analysis of stochastic approximation for efficient least squares regression and LSTD LA Prashanth, N Korda, R Munos arXiv preprint arXiv:1306.2557, 2013	26*	2013
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles X Hu, LA Prashanth, A György, C Szepesvári International Conference on Artificial Intelligence and Statistics (AISTATS …, 2016	18	2016
Simultaneous perturbation Newton algorithms for simulation optimization S Bhatnagar, LA Prashanth Journal of Optimization Theory and Applications 164, 621-643, 2015	18	2015

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren