Code for:
Below are links to a variety of software related to examples and exercises in the book,
organized by chapters (some files appear in multiple places). See particularly the
Mountain Car code.
Most of the rest of the code is written in Common Lisp and requires utility
routines available here. For the graphics, you will need the
the packages for G
and in some cases my graphing tool.
Even if you can not run this code, it still may clarify
some of the details of the experiments. However, there is no guarantee that
the examples in the book were run using exactly the software given.
This code also has not been extensively tested or documented and is being made available "as is".
If you have corrections, extensions, additions or improvements of any kind, please send
them to me at rich@richsutton.com for inclusion here.
- Chapter 1: Introduction
- Chapter 2: Evaluative Feedback
- Chapter 3: The Reinforcement Learning Problem
- Chapter 4: Dynamic Programming
- Chapter 5: Monte Carlo Methods
- Chapter 6: Temporal-Difference Learning
- Chapter 7: Eligibility Traces
- Chapter 8: Generalization and Function Approximation
- Chapter 9: Planning and Learning
- Chapter 10: Dimensions of Reinforcement Learning
- Chapter 11: Case Studies
For other RL software see the
Reinforcement Learning Repository at Michigan State University and
here.