Deep Q-Learning Network in pytorch (not actively maintained)

Hung-Tu Chen

Last update: Jan 1, 2023

Related tags

Overview

pytoch-dqn

This project is pytorch implementation of Human-level control through deep reinforcement learning and I also plan to implement the following ones:

Credit

This project reuses most of the code in https://github.com/berkeleydeeprlcourse/homework/tree/master/hw3

Requirements

python 3.5
gym (built from source)
pytorch (built from source)

Usage

To train a model:

$ python main.py

# To train the model using ram not raw images, helpful for testing

$ python ram.py

The model is defined in dqn_model.py

The algorithm is defined in dqn_learn.py

The running script and hyper-parameters are defined in main.py

You might also like...

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate. Website • Key Features • How To Use • Docs •

21.1k Jan 8, 2023

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

memory_efficient_attention.pytorch A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory" (Rabe&Staats'21). def effic

7 Dec 26, 2022

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis This is a PyTorch implementation of the Deep Streaming Linear Discriminant

41 Dec 25, 2022

Visualizer for neural network, deep learning, and machine learning models

Netron is a viewer for neural network, deep learning and machine learning models. Netron supports ONNX (.onnx, .pb, .pbtxt), Keras (.h5, .keras), Tens

21k Jan 6, 2023

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos. By adopting a unified pipeline-based API design, PyKale enforces standardization and minimalism, via reusing existing resources, reducing repetitions and redundancy, and recycling learning models across areas.

370 Dec 27, 2022

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

XDVioDet Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020. The proj

64 Dec 12, 2022

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

CVPR 2021 | Activate or Not: Learning Customized Activation. This repository contains the official Pytorch implementation of the paper Activate or Not

184 Dec 27, 2022

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral) This is the official implementat

259 Dec 25, 2022

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases. Ivy wraps the functional APIs of existing frameworks. Framework-agnostic functions, libraries and layers can then be written using Ivy, with simultaneous support for all frameworks. Ivy currently supports Jax, TensorFlow, PyTorch, MXNet and Numpy. Check out the docs for more info!

8.2k Jan 2, 2023

Comments

Take a long time to converge

I run the code,and found your dqn algorithm take such a long time to converge.Actually,I found few implementation of dqn can converge in github.They can converge in a afternoon.I use a piece of GTX1080Ti.It is appreciate that your implementation can converge.But your code take a day and a night to converge.I don't know why.

opened by Ja1r0 1
ImportError: No module named 'openai_benchmark'

ubuntu 16.04 python3.5 anaconda installed gym refered https://github.com/openai/gym instruction. but when i run your code

Traceback (most recent call last): File "/home/op/pytorch-dqn-master/main.py", line 3, in import openai_benchmark ImportError: No module named 'openai_benchmark'

i found in the search engine, someone runs dir(gym),the result contains 'benchmark_spec', 'benchmarks', but none on my pc.

opened by jixian79 4
Fixes for issues #3, #5 and #7. Agent learns better
Below is the summary of the contributions made by this PR:

[x] Fixes for issues #3, #5 and #7

[x] Overcomes the limitation of #4. This PR enables the agent to collect more rewards with minimal changes to the original scripts.

[x] Tested with Pytorch 0.2 (as requested in #5 )
opened by praveen-palanisamy 2

pytorch 0.2

I'm running pytorch 0.2,

and the code dqn_learn.py fail to work..

the error as follow

Traceback (most recent call last):
  File "ram.py", line 57, in <module>
    main(env)
  File "ram.py", line 46, in main
    target_update_freq=TARGER_UPDATE_FREQ,
  File "/auto/master05/ssarcandy/ttt/dqn_learn.py", line 213, in dqn_learing
    current_Q_values.backward(d_error.data.unsqueeze(1))
  File "/home/master/05/ssarcandy/.local/lib/python2.7/site-packages/torch/autograd/variable.py", line 156, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph, retain_variables)
  File "/home/master/05/ssarcandy/.local/lib/python2.7/site-packages/torch/autograd/__init__.py", line 98, in backward
    variables, grad_variables, retain_graph)
  File "/home/master/05/ssarcandy/.local/lib/python2.7/site-packages/torch/autograd/function.py", line 91, in apply
    return self._forward_cls.backward(self, *args)
  File "/home/master/05/ssarcandy/.local/lib/python2.7/site-packages/torch/autograd/_functions/tensor.py", line 566, in backward
    return grad_input.scatter_add_(ctx.dim, index, grad_output), None, None
  File "/home/master/05/ssarcandy/.local/lib/python2.7/site-packages/torch/autograd/variable.py", line 696, in scatter_add_
    return ScatterAdd.apply(self, dim, index, source, True)
  File "/home/master/05/ssarcandy/.local/lib/python2.7/site-packages/torch/autograd/_functions/tensor.py", line 605, in forward
    return input.scatter_add_(ctx.dim, index, source)
RuntimeError: invalid argument 3: Index tensor must have same dimensions as input tensor at /pytorch/torch/lib/THC/generic/THCTensorScatterGather.cu:198

opened by SSARCandy 2

Deep Q-Learning Network in pytorch (not actively maintained)

Related tags

Overview

pytoch-dqn

Credit

Requirements

Usage

You might also like...

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

Visualizer for neural network, deep learning, and machine learning models

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

Comments

Take a long time to converge

ImportError: No module named 'openai_benchmark'

Fixes for issues #3, #5 and #7. Agent learns better

pytorch 0.2

Owner

Hung-Tu Chen

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

Deep learning (neural network) based remote photoplethysmography: how to extract pulse signal from video using deep learning tools

Pytorch Implementation of Adversarial Deep Network Embedding for Cross-Network Node Classification

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Deep Q-Learning Network in pytorch (not actively maintained)

Related tags

Overview

pytoch-dqn

Credit

Requirements

Usage

You might also like...

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

Visualizer for neural network, deep learning, and machine learning models

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

Comments

Take a long time to converge

ImportError: No module named 'openai_benchmark'

Fixes for issues #3, #5 and #7. Agent learns better

pytorch 0.2

Owner

Hung-Tu Chen

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

Deep learning (neural network) based remote photoplethysmography: how to extract pulse signal from video using deep learning tools

Pytorch Implementation of Adversarial Deep Network Embedding for Cross-Network Node Classification

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.