Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Last update: Jan 11, 2022

Related tags

Deep Learning c2d

Overview

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Code & Data Appendix for Conjugated Discrete Distributions for Distributional Reinforcement Learning.

Björn Lindenberg, Jonas Nordqvist, Karl-Olof Lindahl

Citation

If you use C2D in your research we ask you to please cite the following:

@misc{lindenberg2021conjugated,
      title={Conjugated Discrete Distributions for Distributional Reinforcement Learning}, 
      author={Björn Lindenberg and Jonas Nordqvist and Karl-Olof Lindahl},
      year={2021},
      eprint={2112.07424},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Data

Agent scores are available in the data folder.
Raw experiment data for each seed is available in the folder data/supplementary.
Each seed was run on a VM Ubuntu 20.04 server with 64GB RAM, a single Nvidia Quadro P4000 GPU and TensorFlow 2.5.

Code

The C++20 source code that handles ALE and transition buffering resides in src.
The agent code, written in TensorFlow/Python (with algorithms), can be viewed in c2d.
Requires cuDNN, TensorFlow 2.X, python3, The Arcade Learning Environment, C++20 and LZ4. For a comprehensive view of dependencies, have a look at our VM setup files in install_scripts.

Atari Games

To avoid legal issues, our Atari 2600 rom file directory ale_roms is left empty. However the corresponding binaries are widely available for import from elsewhere, e.g., Breakout or breakout.bin can be extracted from the atari-py Python package.

Library

The directory ale_roms needs to be populated by the relevant binaries of different Atari games. ALE's checksum file md5.txt for checking binary compatibility is present in the root directory.
The initial library setup or any changes to settings.cmake will require compilation by
```
bash build_lib.sh
```
One can train for one iteration (1M frames) in Breakout with:
```
python3 run.py --game breakout --tag test --iterations 1
```

Figures

Performance Profile (Deep reinforcement learning at the edge of the statistical precipice, Agarwal et al. 2021)

Sampling Efficiency: Mean and Median

Training Graphs

Strong/Weak Examples

Support Evolution

Universal Probability Distributions with Optimal Transport and Convex Optimization

Sylvester normalizing flows for variational inference Pytorch implementation of Sylvester normalizing flows, based on our paper: Sylvester normalizing

172 Dec 13, 2022

Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions

Natural Posterior Network This repository provides the official implementation o

54 Dec 6, 2022

Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift

This repository contains the official code of OSTAR in "Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift" (ICLR 2022).

5 Dec 6, 2022

SurfEmb (CVPR 2022) - SurfEmb: Dense and Continuous Correspondence Distributions

SurfEmb SurfEmb: Dense and Continuous Correspondence Distributions for Object Pose Estimation with Learnt Surface Embeddings Rasmus Laurvig Haugard, A

56 Nov 19, 2022

PyTorch package for the discrete VAE used for DALL·E.

Overview [Blog] [Paper] [Model Card] [Usage] This is the official PyTorch package for the discrete VAE used for DALL·E. Installation Before running th

9.5k Jan 5, 2023

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation This project hosts the code for implementing the DCT-MASK algorithms

57 Nov 27, 2022

This is 2nd term discrete maths project done by UCU students that uses backtracking to solve various problems.

Backtracking Project Sponsors This is a project made by UCU students: Olha Liuba - crossword solver implementation Hanna Yershova - sudoku solver impl

4 Oct 17, 2021

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Implementation of the method described in the Speech Resynthesis from Di

253 Jan 6, 2023

Auto HMM: Automatic Discrete and Continous HMM including Model selection

29 Dec 7, 2022

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Related tags

Overview

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Citation

Data

Code

Atari Games

Library

Figures

Performance Profile (Deep reinforcement learning at the edge of the statistical precipice, Agarwal et al. 2021)

Sampling Efficiency: Mean and Median

Training Graphs

Strong/Weak Examples

Support Evolution

You might also like...

Universal Probability Distributions with Optimal Transport and Convex Optimization

Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions

Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift

SurfEmb (CVPR 2022) - SurfEmb: Dense and Continuous Correspondence Distributions

PyTorch package for the discrete VAE used for DALL·E.

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

This is 2nd term discrete maths project done by UCU students that uses backtracking to solve various problems.

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Auto HMM: Automatic Discrete and Continous HMM including Model selection

Owner

Distributional Sliced-Wasserstein distance code

A Distributional Approach To Controlled Text Generation

A working implementation of the Categorical DQN (Distributional RL).

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

Drslmarkov - Distributionally Robust Structure Learning for Discrete Pairwise Markov Networks

Pytorch implementation of Generative Models as Distributions of Functions 🌿

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

HiddenMarkovModel implements hidden Markov models with Gaussian mixtures as distributions on top of TensorFlow