Transfer-Learning-in-Reinforcement-Learning
Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
Final Report
Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
Cite this work
Nathan Beck, Abhiramon Rajasekharan, Hieu Tran, "Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations", 2021
Project description
Transfer learning approaches in reinforcement learning aim to assist agents in learning their target domains by leveraging the knowledge learned from other agents that have been trained on similar source domains. For example, recent research focus within this space has been placed on knowledge transfer between tasks that have different transition dynamics and reward functions; however, little focus has been placed on knowledge transfer between tasks that have different action spaces.
In this paper, we approach the task of transfer learning between domains that differ in action spaces. We present a reward shaping method based on source embedding similarity that is applicable to domains with both discrete and continuous action spaces. The efficacy of our approach is evaluated on transfer to restricted action spaces in the Acrobot-v1 and Pendulum-v0 domains (Brockman et al. 2016).
Our presentations
Our Google Colab
https://colab.research.google.com/drive/1cQCV9Ko-prpB8sH6FlB4oj781On-ut_w?usp=sharing
Setup
- Clone our repository
- Install Gym
Using pip:
pip install gym
Or Building from Source
git clone https://github.com/openai/gym
cd gym
pip install -e .
How to run?
Run with python IDE
- Open
main.py
ormain_multiple_run.py
- Modify
env_name
andalgorithm
that you want to run - Modify parameters in
transfer_execute
function if needed - Log will be printed out to the terminal and the plotting result will be shown on the new windows.
Run with Google Colab
Follow our sample in file Reward_Shaping_TL.ipynb
to run your own colab.
Implemented Algorithms in Stable-Baseline3
Name | Recurrent | Box |
Discrete |
MultiDiscrete |
MultiBinary |
Multi Processing |
---|---|---|---|---|---|---|
A2C | |
|
|
|
|
|
DDPG | |
|
|
|
|
|
DQN | |
|
|
|
|
|
HER | |
|
|
|
|
|
PPO | |
|
|
|
|
|
SAC | |
|
|
|
|
|
TD3 | |
|
|
|
|
|
QR-DQN1 | |
|
|
|
|
|
TQC1 | |
|
|
|
|
|
Maskable PPO1 | |
|
|
|
|
|
1: Implemented in SB3 Contrib GitHub repository.
Actions gym.spaces
:
Box
: A N-dimensional box that containes every point in the action space.Discrete
: A list of possible actions, where each timestep only one of the actions can be used.MultiDiscrete
: A list of possible actions, where each timestep only one action of each discrete set can be used.MultiBinary
: A list of possible actions, where each timestep any of the actions can be used in any combination.
Refercences
- OpenAI Gym repo
- OpenAI Gym website
- Stable Baselines 3 repo
- Robotschool repo
- Gyem extension repos - This python package is an extension to OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement learning, etc.)
- Example code of TL in DL repo
- Retro Contest - a transfer learning contest that measures a reinforcement learning algorithm’s ability to generalize from previous experience (hosted by OpenAI) link
- Rainbow: Combining Improvements in Deep Reinforcement Learning (repo), (paper)
- Experience replay (link)
- Solving RL classic control (link)
Related papers
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation (paper), (repo)
- Deep Transfer Reinforcement Learning for Text Summarization (paper),(repo)
- Using Transfer Learning Between Games to Improve Deep Reinforcement Learning Performance and Stability (paper), (poster)
- Multi-Source Policy Aggregation for Transfer Reinforcement Learning between Diverse Environmental Dynamics (IJCAI 2020) (paper), (repo)
- Using Transfer Learning Between Games to Improve Deep Reinforcement Learning Performance and Stability (paper), (poster)
- Deep Reinforcement Learning and Transfer Learning with Flappy Bird (paper), (poster)
- Decoupling Dynamics and Reward for Transfer Learning (paper), (repo)
- Progressive Neural Networks (paper)
- Deep Learning for Video Game Playing (paper)
- Disentangled Skill Embeddings for Reinforcement Learning (paper)
- Playing Atari with Deep Reinforcement Learning (paper)
- Dueling Network Architectures for Deep Reinforcement Learning (paper)
- ACTOR-MIMIC DEEP MULTITASK AND TRANSFER REINFORCEMENT LEARNING (paper)
- DDPG (link)
Contributors
- Nathan Beck [email protected]
- Abhiramon Rajasekharan [email protected]
- Trung Hieu Tran [email protected]