GitHub - AnubhavGupta3377/Reinforcement-Learning: Reinforcement Learning Algorithms implementations and Notes based on Sutton and Barto, and David Silver's online course

Overview

This Repository contains my notes from the course "Introduction to Reinforcement Learning" taught by David Silver at DeepMind. Implementation of some of the popular reinforcement learning algorithms is also available.

Lecture notes are primarily based on the course videos, slides and Reinforcemnt Learning textbook by Sutton and Barto mentioned below. All algorithms are implemented in Python3.7.

Resources

Online Course: David Silver's Reinforcement Learning Course
Textbook: Reinforcement Learning: An Introduction (2nd Edition)

Lecture Notes details

Introduction to reinforcement learning
Markov decision processes and Bellman equations
Dynamic programming methods for prediction and control
Model free prediction - Monte-Carlo and temporal-difference prediction
Model free control, Sarsa, Q-learning
Value function approximation methods, deep Q-learning (DQN)
Policy gradient methods
Planning and Learning

List of Implemented Algorithms

Following algorithms are currently implemented:

Dynamic Programming (DP)
- Policy evaluation
- Policy improvement and policy iteration
- Value iteration
Monte Carlo (MC)
- Incremental every-visit MC policy evaluation
- On-policy control using epsilon-greedy policy evaluation
- Off-policy control using weighted importance sampling
Temporal-Difference (TD)
- Sarsa
- Sarsa(lambda) using eligibility traces
- Q-Learning
Function Approximation Methods
- Semi-gradient Q-learning
Deep Q-Networks (DQN)
DQN with Double Q-learning (Double DQN)
Policy Gradient Methods
- REINFORCE-with-baseline: Monte-Carlo Policy Gradient
Dyna-Q algorithm (Planning and Learning)

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
Algorithms Implementations		Algorithms Implementations
Lecture Notes		Lecture Notes
images		images
LICENSE		LICENSE
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Resources

Lecture Notes details

List of Implemented Algorithms

About

Releases

Packages

Languages

License

AnubhavGupta3377/Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Overview

Resources

Lecture Notes details

List of Implemented Algorithms

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages