A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Pulkit Khandelwal

Last update: Dec 28, 2022

Related tags

Deep Learning Reinforcement-Learning-Notebooks

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

Comments

Book and reference information

You are referring to a book... Which exactly? Maybe answered in reference #5, however, it appears the link for your reference #5 is currently broken.

opened by marcoshaw 6

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Algo-ScriptML Python implementations of some of the fundamental Machine Learning models and algorithms from scratch. The goal of this project is not t

81 Nov 26, 2022

Megaverse is a new 3D simulation platform for reinforcement learning and embodied AI research

Megaverse Megaverse is a new 3D simulation platform for reinforcement learning and embodied AI research. The efficient design of the engine enables ph

191 Dec 23, 2022

SenseNet is a sensorimotor and touch simulator for deep reinforcement learning research

59 Feb 25, 2022

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

338 Dec 29, 2022

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Light Gradient Boosting Machine LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed a

14.5k Jan 8, 2023

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Off-Policy Multi-Agent Reinforcement Learning (MARL) Algorithms This repository contains implementations of various off-policy multi-agent reinforceme

183 Dec 28, 2022

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

You might also like...

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Megaverse is a new 3D simulation platform for reinforcement learning and embodied AI research

SenseNet is a sensorimotor and touch simulator for deep reinforcement learning research

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

A toolkit for developing and comparing reinforcement learning algorithms.

PyTorch implementations of deep reinforcement learning algorithms and environments

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Comments

Book and reference information

Owner

Pulkit Khandelwal

Reinforcement learning framework and algorithms implemented in PyTorch.

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

Research on Tabular Deep Learning (Python package & papers)

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

A list of papers regarding generalization in (deep) reinforcement learning

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

Automatic voice-synthetised summaries of latest research papers on arXiv

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning