|
![]() |
|||
|
||||
OverviewFull Product DetailsAuthor: Maxim LapanPublisher: Packt Publishing Limited Imprint: Packt Publishing Limited Edition: 2nd Revised edition ISBN: 9781838826994ISBN 10: 1838826998 Pages: 826 Publication Date: 31 January 2020 Audience: Professional and scholarly , Professional & Vocational Format: Paperback Publisher's Status: Active Availability: Available To Order ![]() We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsTable of Contents What Is Reinforcement Learning? OpenAI Gym Deep Learning with PyTorch The Cross-Entropy Method Tabular Learning and the Bellman Equation Deep Q-Networks Higher-Level RL libraries DQN Extensions Ways to Speed up RL Stocks Trading Using RL Policy Gradients - an Alternative The Actor-Critic Method Asynchronous Advantage Actor-Critic Training Chatbots with RL The TextWorld environment Web Navigation Continuous Action Space RL in Robotics Trust Regions - PPO, TRPO, ACKTR, and SAC Black-Box Optimization in RL Advanced exploration Beyond Model-Free - Imagination AlphaGo Zero RL in Discrete Optimisation Multi-agent RLReviewsAuthor InformationMaxim Lapan is a deep learning enthusiast and independent researcher. His background and 15 years' work expertise as a software developer and a systems architect lies from low-level Linux kernel driver development to performance optimization and design of distributed applications working on thousands of servers. With vast work experiences in big data, machine learning, and large parallel distributed HPC and non-HPC systems, he is able to explain a number of complicated concepts in simple words and vivid examples. His current areas of interest are in practical applications of deep learning, such as deep natural language processing and deep reinforcement learning. Maxim lives in Moscow, Russian Federation, with his family. Tab Content 6Author Website:Countries AvailableAll regions |