PyTorch implementation of the implicit Q-learning algorithm (IQL)

Sebastian Dittert

Last update: Dec 30, 2022

Related tags

Deep Learning reinforcement-learning q-learning iql offline-rl implicit-q-learning batch-rl

Overview

Implicit-Q-Learning (IQL)

PyTorch implementation of the implicit Q-learning algorithm IQL (Paper)

Currently only implemented for online learning. Offline RL version with D4RL will be updated soon.

Run

python train.py

Results

Pendulum

Reference

Original JAX implementation: IQL

Help and issues:

Im open for feedback, found bugs, improvements or anything. Just leave me a message or contact me.

Author

Sebastian Dittert

Feel free to use this code for your own projects or research.

@misc{IQL,
  author = {Dittert, Sebastian},
  title = {PyTorch Implementation of Implicit-Q-Learning (IQL)},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/BY571/Implicit-Q-Learning}},
}

You might also like...

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

Implicit Internal Video Inpainting Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation paper | project

202 Dec 30, 2022

Unofficial Tensorflow 2 implementation of the paper Implicit Neural Representations with Periodic Activation Functions

Siren: Implicit Neural Representations with Periodic Activation Functions The unofficial Tensorflow 2 implementation of the paper Implicit Neural Repr

2 Jun 27, 2022

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow. They have a parallel sampling feature in order to increase computation speed (especially in high-performance computing (HPC)).

3 Dec 28, 2021

Learning Continuous Image Representation with Local Implicit Image Function

LIIF This repository contains the official implementation for LIIF introduced in the following paper: Learning Continuous Image Representation with Lo

1k Dec 25, 2022

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Optimal Model Design for Reinforcement Learning This repository contains JAX code for the paper Control-Oriented Model-Based Reinforcement Learning wi

43 Sep 28, 2022

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

697 Jan 6, 2023

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning PyTorch implementation for the paper: FACIAL: Synthesizing Dynamic Talking

226 Jan 8, 2023

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Code for On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models This repository will reproduce the main results from our pape

32 Nov 25, 2022

An unofficial PyTorch implementation of a federated learning algorithm, FedAvg.

Federated Averaging (FedAvg) in PyTorch An unofficial implementation of FederatedAveraging (or FedAvg) algorithm proposed in the paper Communication-E

123 Jan 6, 2023

Comments

offline training

Hi! Is offline training now fully supported? I am confused because I see the train_offline script but in the README I see that you say that offline training is not implemented. Maybe not with the D4RL dataset, but should it work for any dataset of experiences (s,a,r,s',d)?

Thank you!

opened by roger-creus 0
Question for runtime

Hi, Thanks for sharing the implementation code.

I have a question about IQL experimental runtime on PyTorch. Actually, I tried to re-implement it with tensorflow-keras. But the runtime is quite slow. (on HalfCheetah-medium-v2 with GTX 1080TI)

If you don't mind, could you share the the overall runtime on that environment or computing resource you use? Thanks in advance.

opened by goodboychan 0
bad result on Antmaze enviornment

It works well on mujoco environments, but not on antmze environment .It did not work even if I changed the parameters according to the paper(expectile=0.9, temperature=10). Can you help me please?

opened by dljzx 1
scalar observation

Sebastian, thank you for this great code. I am trying to run some examples here (starting from offline training of antmaz) however I receive an error about "assert np.isscalar(low) and np.isscalar(high)" from the BOX space which is returned from line 18th of single_precision.py". is there something I may missed ? thank you

opened by amirmirzaeinia 0

Owner

Sebastian Dittert

Mechanical Engineer M.Sc. Interested in Deep Reinforcement Learning, AI and Robotics/Self-driving cars

GitHub

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

Proximal Backpropagation Proximal Backpropagation (ProxProp) is a neural network training algorithm that takes implicit instead of explicit gradient s

40 Dec 17, 2022

A PyTorch implementation of Implicit Q-Learning

IQL-PyTorch This repository houses a minimal PyTorch implementation of Implicit Q-Learning (IQL), an offline reinforcement learning algorithm, along w

30 Dec 12, 2022

Official PyTorch implementation of Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations Zhenyu Jiang, Yifeng Zhu, Maxwell Svetlik, Kuan Fang, Yu

UT-Austin Robot Perception and Learning Lab

63 Jan 3, 2023

Pytorch implementation of COIN, a framework for compression with implicit neural representations 🌸

COIN ?? This repo contains a Pytorch implementation of COIN: COmpression with Implicit Neural representations, including code to reproduce all experim

104 Dec 14, 2022

Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".

Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter This is a pytorch-based implementation for paper Implicit Feature Alignme

61 Nov 12, 2022

Digan - Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

DIGAN (ICLR 2022) Official PyTorch implementation of "Generating Videos with Dyn

147 Dec 31, 2022

Offline Reinforcement Learning with Implicit Q-Learning

Offline Reinforcement Learning with Implicit Q-Learning This repository contains the official implementation of Offline Reinforcement Learning with Im

125 Dec 31, 2022

Official PyTorch code of Holistic 3D Scene Understanding from a Single Image with Implicit Representation (CVPR 2021)

Implicit3DUnderstanding (Im3D) [Project Page] Holistic 3D Scene Understanding from a Single Image with Implicit Representation Cheng Zhang, Zhaopeng C

149 Jan 8, 2023

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks. Code, based on the PyTorch framework, for reprodu

3 Dec 27, 2022

Implementation of "Deep Implicit Templates for 3D Shape Representation"

Deep Implicit Templates for 3D Shape Representation Zerong Zheng, Tao Yu, Qionghai Dai, Yebin Liu. arXiv 2020. This repository is an implementation fo

144 Dec 7, 2022

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Related tags

Overview

Implicit-Q-Learning (IQL)

Run

Results

Pendulum

Reference

Help and issues:

Author

You might also like...

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

Unofficial Tensorflow 2 implementation of the paper Implicit Neural Representations with Periodic Activation Functions

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

Learning Continuous Image Representation with Local Implicit Image Function

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

An unofficial PyTorch implementation of a federated learning algorithm, FedAvg.

Comments

offline training

Question for runtime

bad result on Antmaze enviornment

scalar observation

Owner

Sebastian Dittert

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

A PyTorch implementation of Implicit Q-Learning

Official PyTorch implementation of Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Pytorch implementation of COIN, a framework for compression with implicit neural representations 🌸

Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".

Digan - Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

Offline Reinforcement Learning with Implicit Q-Learning

Official PyTorch code of Holistic 3D Scene Understanding from a Single Image with Implicit Representation (CVPR 2021)

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Implementation of "Deep Implicit Templates for 3D Shape Representation"