Workshop
in Reinforcement Learning
(0368-3500-07)
Workshop project:
- The project will be done in groups of 2-3 students.
- Each group will implement a learning algorithm for
a board game.
- The background material needed would be covered
during the lectures.
- Requirements
document
Suggested Projects
More Challenging Projects
Workshop Outline
Week 1: Min Max
Trees
Week 2: Introduction
to Reinforcement Learning: Model and Planning.
Week 3: Reinforcement
learning: Learning (small state space)
Week 4: Reinforcement
learning: Learning (large state space)
Week 5: Simple
Graphics (GUI)
Teams and Games
- Nira Amit and Assaf Shtilman
- Aner Mazursky and Amit Ben-David
- Chen Frenkel and Ron Frenkel
- Ori Lahav and Ariel Lvantel
- Shelly
Machleb and Benny Davidovich
- Michal
Samuel and Yuval Kalev
(email Elad with any spelling
mistakes)
Sample Code
Basic Tic Toe
implemented in C++.
Basic Tic
Toe implemented in Java.
Bibliography [for background]
- A.G.
Barto and R.S., Reinforcement
Learning, MIT Press, 1998.
- Bertsekas,
D. P. and Tsitsiklis, J. N. (1996). Neural Dynamic Programming.
Athena Scientific, Belmont,
MA.
- Gardner (1981).
Samuel's checkers player. In Barr, A. and Feigenbaum, E. A., editors, The
Handbook of Artificial Intelligence, I, pages 84--108. William
Kaufmann, Los Altos, CA.
- Samuel,
A. L. (1967). Some studies in machine learning using the game of checkers.
II---Recent progress. IBM Journal on Research and Development,
pages 601--617.
- Tesauro,
G. J. (1994). TD--gammon, a self-teaching backgammon program, achieves
master-level play. Neural Computation, 6(2):215--219.
- Tesauro,
G. J. (1995). Temporal difference learning and TD-Gammon. Communications
of the ACM, 38:58--68.
- Tsitsiklis,
J. N. and Van Roy, B. (1996). Feature-based methods for large scale
dynamic programming. Machine Learning, 22:59--94.
Previous
Workshops: 1 2 3