
Deep Reinforcement Learning Hands-On
Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
Publisher:Packt Publishing Limited
By: Oleg Vasilev, Maxim Lapan and Martijn van Otterlo
Paid access
|Jun 2024Table of Contents
- What is Reinforcement Learning?
- OpenAI Gym
- Deep Learning with PyTorch
- The Cross-Entropy Method
- Tabular Learning and the Bellman Equation
- Deep Q-Networks
- DQN Extensions
- Stocks Trading Using RL
- Policy Gradients – An Alternative
- The Actor-Critic Method
- Asynchronous Advantage Actor-Critic
- Chatbots Training with RL
- Web Navigation
- Continuous Action Space
- Trust Regions – TRPO, PPO, and ACKTR
- Black-Box Optimization in RL
- Beyond Model-Free – Imagination
- AlphaGo Zero
PDF ISBN: 978-1-78883-930-3
Publisher: Packt Publishing Limited
Copyright owner: © 2018 Packt Publishing Limited
Publication date: 2024
Language: English
Pages: 546
Related subjects:
