A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning

Mathieu Godbout

Last update: Nov 19, 2021

Related tags

Deep Learning CvarAdversarialRL

Overview

CvarAdversarialRL

Official code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning".

Initial setup

Create a virtual environment using

python3 -m venv ${YOUR_VENVS_DIR}/cvarRL

and activate it

source ${YOUR_VENVS_DIR}/cvarRL/bin/activate

Install the necessary requirements

pip3 install -r requirements.txt

Add the current folder to your PYTHONPATH

export PYTHONPATH="${PYTHONPATH}:${YOUR_PARENT_DIR}/CvarAdversarialRL"

Running the experiments and collecting figures

Scripts are produced to allow easy reproductibility of our results. They can be found in the scripts folder.

To run experiments:

./scripts/run_experiments.sh

To generate figures:

./scripts/generate_figures.sh

You might also like...

Tilted Empirical Risk Minimization (ICLR '21)

Tilted Empirical Risk Minimization This repository contains the implementation for the paper Tilted Empirical Risk Minimization ICLR 2021 Empirical ri

40 Nov 28, 2022

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

Mixup: Beyond Empirical Risk Minimization in PyTorch This is an unofficial PyTorch implementation of mixup: Beyond Empirical Risk Minimization. The co

121 Dec 17, 2022

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification A pilot project for the NHS AI Lab Skunkworks team, Long Stayer Risk Stratification use

21 Nov 14, 2022

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

AMAZ3DSim AMAZ3DSim is a lightweight python-based 3D network multi-agent simulator. It uses a cell-based congestion model. It calculates risk, battery

13 Nov 4, 2022

Collision risk estimation using stochastic motion models

collision_risk_estimation Collision risk estimation using stochastic motion models. This is a new approach, based on stochastic models, to predict the

7 Jun 26, 2022

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

[AAAI22] Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification We point out the overlooked unbiasedness in long-tailed clas

28 Oct 18, 2022

Pynomial - a lightweight python library for implementing the many confidence intervals for the risk parameter of a binomial model

9 Oct 4, 2022

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective [PDF] Wuyang Chen, Xinyu Gong, Zhangyang Wang In ICLR 2

156 Nov 28, 2022

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Segmentation Transformer Implementation of Segmentation Transformer in PyTorch, a new model to achieve SOTA in semantic segmentation while using trans

161 Dec 8, 2022

Owner

Mathieu Godbout

GitHub

Information-Theoretic Multi-Objective Bayesian Optimization with Continuous Approximations

Information-Theoretic Multi-Objective Bayesian Optimization with Continuous Approximations Requirements The code is implemented in Python and requires

1 Nov 3, 2021

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning".

8 Dec 8, 2022

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

CQL-JAX This repository implements Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX (FLAX). Implementation is built on

8 Nov 7, 2022

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

DSE 314/614: Reinforcement Learning This repository containing reinforcement lea

4 Apr 15, 2022

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

BMW-Anonymization-Api Data privacy and individuals’ anonymity are and always have been a major concern for data-driven companies. Therefore, we design

148 Dec 21, 2022

A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning

Related tags

Overview

CvarAdversarialRL

Initial setup

Running the experiments and collecting figures

You might also like...

Tilted Empirical Risk Minimization (ICLR '21)

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

Collision risk estimation using stochastic motion models

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

Pynomial - a lightweight python library for implementing the many confidence intervals for the risk parameter of a binomial model

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Owner

Mathieu Godbout

Information-Theoretic Multi-Objective Bayesian Optimization with Continuous Approximations

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

Location-Sensitive Visual Recognition with Cross-IOU Loss

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion (CVPR'2021, Oral)

Pull sensitive data from users on windows including discord tokens and chrome data.

Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method