Description |
1 online resource (xiii, 344 pages) |
Series |
Lecture notes in artificial intelligence ; 7188 |
|
Lecture notes in computer science, 0302-9743 |
|
LNCS sublibrary. SL 7, Artificial intelligence |
|
Lecture notes in computer science. Lecture notes in artificial intelligence ; 7188
|
|
Lecture notes in computer science.
|
|
LNCS sublibrary. SL 7, Artificial intelligence.
|
Contents |
Invited Talk: UCRL and Autonomous Exploration / Peter Auer -- Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning / Kristian Kersting -- Invited Talk: PRISM -- Practical RL: Representation, Interaction, Synthesis, and Mortality / Peter Stone -- Invited Talk: Towards Robust Reinforcement Learning Algorithms / Csaba Szepesvári -- Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits / Francis Maes, Louis Wehenkel and Damien Ernst -- Goal-Directed Online Learning of Predictive Models / Sylvie C.W. Ong, Yuri Grinberg and Joelle Pineau -- Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control / Matthew Robards and Peter Sunehag -- Active Learning of MDP Models / Mauricio Araya-López, Olivier Buffet, Vincent Thomas and François Charpillet -- Handling Ambiguous Effects in Action Learning / Boris Lesner and Bruno Zanuttini -- Feature Reinforcement Learning in Practice / Phuong Nguyen, Peter Sunehag and Marcus Hutter -- Reinforcement Learning with a Bilinear Q Function / Charles Elkan -- l1-Penalized Projected Bellman Residual / Matthieu Geist and Bruno Scherrer |
|
Regularized Least Squares Temporal Difference Learning with Nested l2 and l1 Penalization / Matthew W. Hoffman, Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos -- Recursive Least-Squares Learning with Eligibility Traces / Bruno Scherrer and Matthieu Geist -- Value Function Approximation through Sparse Bayesian Modeling / Nikolaos Tziortziotis and Konstantinos Blekas -- Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics / Pablo Samuel Castro and Doina Precup -- Unified Inter and Intra Options Learning Using Policy Gradient Methods / Kfir Y. Levy and Nahum Shimkin -- Options with Exceptions / Munu Sairamesh and Balaraman Ravindran -- Robust Bayesian Reinforcement Learning through Tight Lower Bounds / Christos Dimitrakakis -- Optimized Look-ahead Tree Search Policies / Francis Maes, Louis Wehenkel and Damien Ernst -- A Framework for Computing Bounds for the Return of a Policy / Cosmin Păduraru, Doina Precup and Joelle Pineau -- Transferring Evolved Reservoir Features in Reinforcement Learning Tasks / Kyriakos C. Chatzidimitriou, Ioannis Partalas, Pericles A. Mitkas and Ioannis Vlahavas |
|
Transfer Learning via Multiple Inter-task Mappings / Anestis Fachantidis, Ioannis Partalas, Matthew E. Taylor and Ioannis Vlahavas -- Multi-Task Reinforcement Learning: Shaping and Feature Selection / Matthijs Snel and Shimon Whiteson -- Transfer Learning in Multi-Agent Reinforcement Learning Domains / Georgios Boutsioukis, Ioannis Partalas and Ioannis Vlahavas -- An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings / Ioannis Lambrou, Vassilis Vassiliades and Chris Christodoulou -- Bayesian Multitask Inverse Reinforcement Learning / Christos Dimitrakakis and Constantin A. Rothkopf -- Batch, Off-Policy and Model-Free Apprenticeship Learning / Edouard Klein, Matthieu Geist and Olivier Pietquin -- Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot / Seiya Kuroda, Kazuteru Miyazaki and Hiroaki Kobayashi -- MapReduce for Parallel Reinforcement Learning / Yuxi Li and Dale Schuurmans -- Compound Reinforcement Learning: Theory and an Application to Finance / Tohgoroh Matsui, Takashi Goto, Kiyoshi Izumi and Yu Chen -- Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning / Kazuteru Miyazaki and Masaaki Ida |
Summary |
This book constitutes revised and selected papers of the 9th European Workshop on Reinforcement Learning, EWRL 2011, which took place in Athens, Greece in September 2011. The papers presented were carefully reviewed and selected from 40 submissions. The papers are organized in topical sections online reinforcement learning, learning and exploring MDPs, function approximation methods for reinforcement learning, macro-actions in reinforcement learning, policy search and bounds, multi-task and transfer reinforcement learning, multi-agent reinforcement learning, apprenticeship and inverse reinforcement learning and real-world reinforcement learning |
Analysis |
Computer science |
|
Computer software |
|
Database management |
|
Artificial intelligence |
|
Computation by Abstract Devices |
|
Algorithm Analysis and Problem Complexity |
|
Information Systems Applications (incl. Internet) |
|
Probability and Statistics in Computer Science |
Bibliography |
Includes bibliographical references and author index |
Notes |
English |
Subject |
Reinforcement learning -- Congresses
|
|
Informatique.
|
|
Reinforcement learning
|
Genre/Form |
proceedings (reports)
|
|
Conference papers and proceedings
|
|
Conference papers and proceedings.
|
|
Actes de congrès.
|
Form |
Electronic book
|
Author |
Sanner, Scott.
|
|
Hutter, Marcus.
|
ISBN |
9783642299469 |
|
3642299466 |
|
3642299458 |
|
9783642299452 |
|