Reinfore learning tool box, contains trpo, a3c algorithm for continous action space

yupei.wu

Last update: Oct 10, 2022

Related tags

Deep Learning RL_toolbox

Overview

RL_toolbox

all the algorithm is running on pycharm IDE, or the package loss error may exist.

implemented algorithm: trpo a3c

a3c:for continous action space, use multi processes, but saving model has not been implemented.
trpo:for continous and discrete action space

run

a3c:run a3c/a3c_continous.py in pycharm IDE
trpo:run experiment/trpo_continous.py in pycharm IDE

contain some useful reinforcement learning algorithm and relative tool

Comments

Unable to train model

On executing trpo_continous.py, I get the following error:

[2017-07-01 23:52:58,375] Making new env: CartPole-v0 [TL] InputLayer continous_shared/continous_input_layer: (?, 3) [TL] DenseLayer continous_shared/continous_fc1: 64 relu [TL] DenseLayer continous_shared/continous_fc2: 64 relu [TL] DenseLayer continous_shared/continous_fc3: 1 relu

********** Iteration 0 ************ Traceback (most recent call last): File "experiment/trpo_continous.py", line 62, in agent.learn() File "/home/abhinav/Desktop/major/parallel-trpo/RL_toolbox/RLToolbox/agent/TRPO_agent.py", line 80, in learn stats , theta , thprev = self.train_mini_batch(linear_search=False) File "/home/abhinav/Desktop/major/parallel-trpo/RL_toolbox/RLToolbox/algorithm/TRPO.py", line 62, in train_mini_batch self.get_samples(self.pms.paths_number) File "/home/abhinav/Desktop/major/parallel-trpo/RL_toolbox/RLToolbox/algorithm/TRPO.py", line 29, in get_samples self.storage.get_single_path() File "/home/abhinav/Desktop/major/parallel-trpo/RL_toolbox/RLToolbox/storage/storage_continous.py", line 36, in get_single_path a, agent_info = self.agent.get_action(o) File "/home/abhinav/Desktop/major/parallel-trpo/RL_toolbox/RLToolbox/algorithm/TRPO.py", line 43, in get_action {self.net.obs: obs}) File "/home/abhinav/anaconda2/envs/osim/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 710, in run run_metadata_ptr) File "/home/abhinav/anaconda2/envs/osim/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 887, in _run % (np_val.shape, subfeed_t.name, str(subfeed_t.get_shape()))) ValueError: Cannot feed value of shape (1, 4) for Tensor u'continous_shared/continous_obs:0', which has shape '(?, 3)'

opened by abhinavrai44 1

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

Action Transformer A Self-Attention Model for Short-Time Human Action Recognition This repository contains the official TensorFlow implementation of t

20 Jan 3, 2023

A clean and robust Pytorch implementation of PPO on continuous action space.

PPO-Continuous-Pytorch I found the current implementation of PPO on continuous action space is whether somewhat complicated or not stable. And this is

56 Dec 16, 2022

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

FuseDream This repo contains code for our paper (paper link): FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimizat

191 Dec 31, 2022

Space robot - (Course Project) Using the space robot to capture the target satellite that is disabled and spinning, then stabilize and fix it up

3 Jan 7, 2022

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

Black-Box-Defense This repository contains the code and models necessary to replicate the results of our recent paper: How to Robustify Black-Box ML M

2 Oct 5, 2022

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

SVM Données Une base d’images contient 490 images pour l’apprentissage (400 voitures et 90 bateaux), et encore 21 images pour fait des tests. Prétrait

3 Nov 30, 2021

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Adam-NSCL This is a PyTorch implementation of Adam-NSCL algorithm for continual learning from our CVPR2021 (oral) paper: Title: Training Networks in N

34 Dec 21, 2022

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

TSForecasting This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the tim

80 Dec 30, 2022

Reinfore learning tool box, contains trpo, a3c algorithm for continous action space

Related tags

Overview

RL_toolbox

all the algorithm is running on pycharm IDE, or the package loss error may exist.

implemented algorithm: trpo a3c

run

contain some useful reinforcement learning algorithm and relative tool

You might also like...

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

A clean and robust Pytorch implementation of PPO on continuous action space.

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

Space robot - (Course Project) Using the space robot to capture the target satellite that is disabled and spinning, then stabilize and fix it up

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

Comments

Unable to train model

Owner

yupei.wu

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

Implement A3C for Mujoco gym envs

A3C LSTM Atari with Pytorch plus A3G design

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Human Action Controller - A human action controller running on different platforms.