TorchGRL is the source code for our paper Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments for IV 2022.

XXQQ

Last update: Dec 9, 2022

Related tags

Deep Learning TorchGRL

Overview

TorchGRL

TorchGRL is the source code for our paper Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments for IV 2022.TorchGRL is a modular simulation framework that integrates different GRL algorithms and SUMO simulation platform to realize the simulation of multi-agents decision-making algorithms in mixed traffic environment. You can adjust the test scenarios and the implemented GRL algorithm according to your needs.

Preparation
Installation
Instruction
Tutorial

Preparation

Before starting to carry out some relevant works on our framework, some preparations are required to be done.

Hardware

Our framework is developed based on a laptop, and the specific configuration is as follows:

Operating system: Ubuntu 20.04
RAM: 32 GB
CPU: Intel (R) Core (TM) i9-10980HK CPU @ 2.40GHz
GPU: RTX 2070

It should be noted that our program must be reproduced under the Ubuntu 20.04 operating system, and we strongly recommend using GPU for training.

Development Environment

Before compiling the code of our framework, you need to install the following development environment:

Ubuntu 20.04 with latest GPU driver
Pycharm
Anaconda
CUDA 11.1
cudnn-11.1, 8.0.5.39

Installation

Please download our GRL framework repository first:

git clone https://github.com/Jacklinkk/TorchGRL.git

Then enter the root directory of TorchGRL:

cd TorchGRL

and please be sure to run the below commands from /path/to/TorchGRL.

Installation of FLOW

The FLOW library will be firstly installed.

Firstly, enter the flow directory:

cd flow

Then, create a conda environment from flow library:

conda env create -f environment.yml

Activate conda environment:

conda activate TorchGCQ

Install flow from source code:

python setup.py develop

Installation of SUMO

SUMO simulation platform will be installed. Please make sure to run the below commands in the "TorchGRL" virtual environment.

Install via pip:

pip install eclipse-sumo

Setting in Pycharm:

In order to adopt SUMO correctly, you need to define the environment variable of SUMO_HOME in Pycharm. The specific directory is:

/home/…/.conda/envs/TorchGCQ/lib/python3.7/site-packages/sumo

Setting in Ubuntu:

At first, run:

gedit ~/.bashrc

then copy the path name of SUMO_HOME to “~/.bashrc”:

export SUMO_HOME=“/home/…/.conda/envs/TorchGCQ/lib/python3.7/site-packages/sumo”

Finally, run:

source ~/.bashrc

Installation of Pytorch and related libraries

Please make sure to run the below commands in the "TorchGRL" virtual environment.

Installation of Pytorch:

We use Pytorch version 1.9.0 for development under a specific version of CUDA and cudnn.

pip3 install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html

Installation of pytorch geometric:

Pytorch geometric is a Graph Neural Network (GNN) library upon Pytorch

pip install torch-scatter torch-sparse torch-cluster torch-spline-conv torch-geometric -f https://data.pyg.org/whl/torch-1.9.0+cu111.html

Installation of pfrl library

Please make sure to run the below commands in the "TorchGRL" virtual environment.

pfrl is a deep reinforcement learning library that implements various algorithms in Python using PyTorch.

Firstly, enter the pfrl directory:

cd pfrl

Then install from source code:

python setup.py develop

Instruction

flow folder

The flow folder is the root directory of the library after the FLOW library is installed through source code, including interface-related programs between DRL algorithms and SUMO platform.

Flow_Test folder

The Flow_Test folder includes the related programs of the test environment configuration; specifically, T_01.py is the core python program. If the program runs successfully, the environment configuration is successful.

pfrl folder

The pfrl folder is the root directory of the library after the deep reinforcement learning pfrl library is installed through source code, including all DRL related programs. The source program can be modified as needed.

GRLNet folder

The GRLNet folder contains the GRL neural network built in the Pytorch environment. You can modify the source code as needed or add your own neural network.

Pytorch_GRL.py constructs the fundamental neural network of GRL algorithms
Pytorch_GRL_Dueling.py constructs the dueling network of GRL algorithms

GRL_utils folder

The GRL_utils folder contains basic functions such as model training and testing, data storage, and curve drawing.

Train_and_Test.py contains the training and testing functions for the GRL model.
Data_Plot_Train.py is the function to plot the training data curve.
Data_Process_Test.py is the function to process the test data.
Fig folder stores the training data curve.
Logging_Training folder stores the training data generated by different GRL algorithms.
Logging_Test folder stores the testing data generated by different GRL algorithms.

GRL_Simulation folder

The GRL_Simulation folder is the core of our framework, which contains the core simulation program and some related functional programs.

main.py is the main program, containing the definition of FLOW parameters, as well as the controlling (start and end) of the simulation.
controller.py is the definition of vehicle control model based on FLOW library.
environment.py is the core program to build and initialize the simulation environment of SUMO.
network.py defines the road network.
registry_custom.py registers the simulation environment of SUMO to the gym library to realize the connection with GRL algorithms.
specific_environment.py defines the elements in MDPs, including state representation, action space and reward function.
Experiment folder is the core program of co-simulation under different GRL algorithms, including the initialization of the simulation environment, the initialization of the neural network, the training and testing of GRL algorithms, and the preservation of the training and testing results.
GRL_Trained_Models folder stores the trained GRL model when the training process ends.

Tutorial

You can simply run "main.py" in Pycharm to simulate the GRL algorithm, and observe the simulation process in SUMO platform. You can generate training plot such as Reward curve:

Verification of other algorithms

If you want to verify other algorithms, you can develop the source code as needed under the "Experiment folder", and don't forget to change the imported python script in "main.py". In addition, you can also construct your own network in GRLNet folder.

Verification of other traffic scenario

If you want to verify other traffic scenario, you can define a new scenario in "network.py". You can refer to the documentation of SUMO for more details .

Comments

No module named 'flow'

Thank you for sharing your code

i tried to install Flow according to your instructions cd flow conda env create -f environment.yml conda activate TorchGRL python setup.py develop

But may be I had errors when installed Flow

opened by TrinhTuanHung2021 4
Questions about the exploration
Thanks for the wonderful repo and paper! I am wondering if the authors can help me with a few questions. Thanks for your time and insights!

the self. training is always True at https://github.com/Jacklinkk/TorchGRL/blob/830e41e0cba943f3834fbca7004b09fc9604cf79/pfrl/pfrl/agents/dqn.py#L534, even in the testing settings since the "training" is set as True at https://github.com/Jacklinkk/TorchGRL/blob/830e41e0cba943f3834fbca7004b09fc9604cf79/pfrl/pfrl/agent.py#L12, I am not sure why it is always set as True even in the testing.

The second question is related to the above question. In the https://github.com/Jacklinkk/TorchGRL/blob/830e41e0cba943f3834fbca7004b09fc9604cf79/pfrl/pfrl/agents/dqn.py#L527 function, if we set the self. training as False, then batch_action = batch_argmax. Then the algorithm does not work well at all.

In https://github.com/Jacklinkk/TorchGRL/blob/830e41e0cba943f3834fbca7004b09fc9604cf79/pfrl/pfrl/agents/dqn.py#L538, the actions are selected by epsilon_greedy.py, while the epsilon is always outputted as 0.3 and not changed during training if we print it here: https://github.com/Jacklinkk/TorchGRL/blob/830e41e0cba943f3834fbca7004b09fc9604cf79/pfrl/pfrl/explorers/epsilon_greedy.py#L8. Could you please give any ideas about this? (in the standard DQN, the epsilon would decay from large number to a small number)

The paper said the action space is [change to left; go straight; change to right], while the action space is defined as 2 at: https://github.com/Jacklinkk/TorchGRL/blob/830e41e0cba943f3834fbca7004b09fc9604cf79/GRL_Simulation/Experiment/DQN_experiments.py#L113.
opened by DongChen06 0
Could not reproduce the reward

hello authors, thank you so much for implementing a PyTorch version of GCQ, the code is clear and the pytorch is much up to date than the original keras-rl. Could you please help me figure out why in my training and testing periods, the reward is fluctuating in a low level (it did not improve after the train begin, no jump in reward could be observed). Thank you in advance!

opened by wangshuo1994 2

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

Deep GNN, Shallow Sampling Hanqing Zeng, Muhan Zhang, Yinglong Xia, Ajitesh Srivastava, Andrey Malevich, Rajgopal Kannan, Viktor Prasanna, Long Jin, R

117 Dec 20, 2022

PyTorch implementations of deep reinforcement learning algorithms and environments

Deep Reinforcement Learning Algorithms with PyTorch This repository contains PyTorch implementations of deep reinforcement learning algorithms and env

4.7k Jan 4, 2023

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation Prerequisite Please create and activate the following conda envrionment. To r

87 Jan 8, 2023

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

RIIT Our open-source code for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implement and standard

405 Jan 6, 2023

A library of multi-agent reinforcement learning components and systems

Mava: a research framework for distributed multi-agent reinforcement learning Table of Contents Overview Getting Started Supported Environments System

463 Dec 23, 2022

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Off-Policy Multi-Agent Reinforcement Learning (MARL) Algorithms This repository contains implementations of various off-policy multi-agent reinforceme

183 Dec 28, 2022

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

UPDeT Official Implementation of UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers (ICLR 2021 spotlight) The

96 Dec 22, 2022

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks (MAPDN)

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks (MAPDN) This is the implementation of the paper Multi-Age

83 Jan 6, 2023

CityLearn Challenge Multi-Agent Reinforcement Learning for Intelligent Energy Management, 2020, PikaPika team

Citylearn Challenge This is the PyTorch implementation for PikaPika team, CityLearn Challenge Multi-Agent Reinforcement Learning for Intelligent Energ

10 Oct 10, 2022

TorchGRL is the source code for our paper Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments for IV 2022.

Related tags

Overview

TorchGRL

Preparation

Hardware

Development Environment

Installation

Installation of FLOW

Installation of SUMO

Installation of Pytorch and related libraries

Installation of pfrl library

Instruction

flow folder

Flow_Test folder

pfrl folder

GRLNet folder

GRL_utils folder

GRL_Simulation folder

Tutorial

Verification of other algorithms

Verification of other traffic scenario

You might also like...

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

PyTorch implementations of deep reinforcement learning algorithms and environments

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

A library of multi-agent reinforcement learning components and systems

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks (MAPDN)

CityLearn Challenge Multi-Agent Reinforcement Learning for Intelligent Energy Management, 2020, PikaPika team

Comments

No module named 'flow'

Questions about the exploration

Could not reproduce the reward

Owner

XXQQ

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

DI-smartcross - Decision Intelligence Platform for Traffic Crossing Signal Control

A parallel framework for population-based multi-agent reinforcement learning.

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Multi-objective gym environments for reinforcement learning.

Deep Reinforcement Learning based Trading Agent for Bitcoin

Urban mobility simulations with Python3, RLlib (Deep Reinforcement Learning) and Mesa (Agent-based modeling)

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning