|
![]() |
|||
|
||||
OverviewWhile deep reinforcement learning (RL) has fueled multiple high-profile successes in machine learning, it is held back from more widespread adoption by its often poor data efficiency and the limited generality of the policies it produces. A promising approach for alleviating these limitations is to cast the development of better RL algorithms as a machine learning problem itself in a process called meta-RL. Meta-RL considers a family of machine learning (ML) methods that learn to reinforcement learn. That is, meta-RL methods use sample-inefficient ML to learn sample-efficient RL algorithms, or components thereof. Meta-RL is most commonly studied in a problem setting where, given a distribution of tasks, the goal is to learn a policy that is capable of adapting to any new task from the task distribution with as little data as possible. In this monograph, the meta-RL problem setting is described in detail as well as its major variations. At a high level the book discusses how meta-RL research can be clustered based on the presence of a task distribution and the learning budget available for each individual task. Using these clusters, the meta-RL algorithms and applications are surveyed. The monograph concludes by presenting the open problems on the path to making meta-RL part of the standard toolbox for a deep RL practitioner. Full Product DetailsAuthor: Jacob Beck , Risto Vuorio , Evan Zheran Liu , Zheng XiongPublisher: now publishers Inc Imprint: now publishers Inc Weight: 0.257kg ISBN: 9781638285403ISBN 10: 1638285403 Pages: 176 Publication Date: 03 April 2025 Audience: Professional and scholarly , Professional & Vocational Format: Paperback Publisher's Status: Active Availability: In Print ![]() This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us. Table of Contents1. Introduction 2. Background 3. Few-shot Meta-RL 4. Many-shot Meta-RL 5. Applications 6. Open Problems 7. Conclusion Appendix ReferencesReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |