RRL: Resnet as representation for Reinforcement Learning

Meta Research

Last update: Dec 7, 2022

Related tags

Deep Learning RRL

Overview

Quick Links

Wesbite | Paper | Video

RRL: Resnet as representation for Reinforcement Learning

Resnet as representation for Reinforcement Learning (RRL) is a simple yet effective approach for training behaviors directly from visual inputs. We demonstrate that features learned by standard image classification models are general towards different task, robust to visual distractors, and when used in conjunction with standard Imitation Learning or Reinforcement Learning pipelines can efficiently acquire behaviors directly from proprioceptive inputs.

Final Behaviors acquired using RRL on ADROIT benchmark tasks (left to right) (a) Opening a door (b) Hammering a nail (c) Pen-twirling (d)) Object relocation

Setup

RRL codebase can be installed by cloning this repository. Note that it uses git submodules to resolve dependencies. Please follow the steps as below to install correctly.

Clone this repository along with the submodules

git clone --recursive https://github.com/facebookresearch/RRL.git

Install the package using conda. The dependencies (apart from mujoco_py) are listed in env.yml
```
conda env create -f env.yml

conda activate rrl
```
The environment require MuJoCo as a dependency. You may need to obtain a license and follow the setup instructions for mujoco_py. Setting up mujoco_py with GPU support is highly recommended.

Install mj_envs and mjrl repositories.

cd RRL
pip install -e mjrl/.
pip install -e mj_envs/.
pip install -e .

Additionally, it requires the demonstrations published by hand_dapg

Running Instructions

First step is to convert the observations of demonstrations provided by hand_dapg to the encoder feature space. An example script is provided here. Note the script saves the demonstrations in a .pickle format inside the rrl/demonstrations directory.

For the mj_envs tasks :

python convertDemos.py --env_name hammer-v0 --encoder_type resnet34 -c top -d

python convertDemos.py --env_name door-v0 --encoder_type resnet34 -c top -d

python convertDemos.py --env_name pen-v0 --encoder_type resnet34 -c vil_camera -d

python convertDemos.py --env_name relocate-v0 --encoder_type resnet34 -c cam1 -c cam2 -c cam3 -d

Launching RRL experiments using DAPG.

An example launching script is provided job_script.py in the examples/ directory and the configs used are stored in the examples/config/ directory. Note : Hydra configs are used.

python job_script.py  demo_file=
     
       --config-name hammer_dapg

python job_script.py  demo_file=
     
       --config-name door_dapg

python job_script.py  demo_file=
     
       --config-name pen_dapg

python job_script.py  demo_file=
     
       --config-name relocate_dapg

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

5 Nov 3, 2022

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

cdf_att_classification classes = {0: 'cat', 1: 'dog', 2: 'flower'} In this project we use both Resnet and Self-attention layer for cdf-Classification.

3 Nov 23, 2022

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

Pre-trained image classification models for Jax/Haiku Jax/Haiku Applications are deep learning models that are made available alongside pre-trained we

14 Dec 20, 2022

Eff video representation - Efficient video representation through neural fields

Neural Residual Flow Fields for Efficient Video Representations 1. Download MPI

41 Jan 6, 2023

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

A tour through tensorflow with financial data I present several models ranging in complexity from simple regression to LSTM and policy networks. The s

195 Dec 7, 2022

Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021.

Dense Contrastive Learning for Self-Supervised Visual Pre-Training This project hosts the code for implementing the DenseCL algorithm for se

491 Jan 3, 2023

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

Viewmaker Networks: Learning Views for Unsupervised Representation Learning Alex Tamkin, Mike Wu, and Noah Goodman Paper link: https://arxiv.org/abs/2

31 Dec 1, 2022

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Unofficial implementation: MoCo: Momentum Contrast for Unsupervised Visual Representation Learning (Paper) InsDis: Unsupervised Feature Learning via N

16 Nov 4, 2020

Self-supervised learning on Graph Representation Learning (node-level task)

graph_SSL Self-supervised learning on Graph Representation Learning (node-level task) How to run the code To run GRACE, sh run_GRACE.sh To run GCA, sh

3 Dec 31, 2021

Comments

Docker image for adroit?

Hi! I'm just curious if you have been using docker containers to run experiments with Adroit? When I tried the rendering in a docker container (using a nvidia base image), it seems this line img = e.env.sim.render() can be very slow. And it is unclear what the reason is...

If you have used docker then it would be great if you can share any information about the container you used... that would be really useful! Really appreciate your help!

opened by watchernyu 2
Demonstration has no seed?
Hi! I noticed that when converting the demonstrations, it will give this warning message.

"++++++++++++++++++++++++++++ Couldn't find the seed of the demos. Please verify."

When I print out the keys of the loaded demonstration file (using the commands below), it seems that the demonstrations don't really have the "seed" entry (I downloaded from the dapg repo), I'm not sure if this is the expected behavior? Thank you very much!

import pickle demo_paths = pickle.load(open('relocate-v0_demos.pickle', 'rb')) print(len(demo_paths)) print(demo_paths[0].keys()) # output is: # 25 # dict_keys(['observations', 'init_state_dict', 'actions', 'rewards'])
opened by watchernyu 2
Using cpu for testing?

First of all thank you very much for presenting this nice paper and open source the code!

I'm wondering if it's possible to use cpu instead of gpu to do some testing on my local machine (which unfortunately does not have a cuda gpu)? I tried to modify the code a bit and run (I have the demo files under ../demo)

python convertDemos.py --env_name hammer-v0 --encoder_type resnet34 -c top -d ../demo/hammer-v0_demos.pickle

Then it gives me the following error, ERROR: GLEW initalization error: Missing GL version which occurs at the rendering line in multicam.py: img = self._env.env.sim.render(width=self.width, height=self.height, mode='offscreen', camera_name=cam)

Is there an easy way to ask mujoco to do cpu rendering here (software rendering)? Thanks a lot for the help!

opened by watchernyu 2
How to turn on visual distractions?

Hi! I'm curious on how can we turn on the visual distractions such as light position, direction, object color etc. in the Adroit environment? (as mentioned in section 7.5 in the appendix of your paper)

Thanks a lot for your help!

opened by watchernyu 0

Owner

Meta Research

GitHub

3D ResNet Video Classification accelerated by TensorRT

Activity Recognition TensorRT Perform video classification using 3D ResNets trained on Kinetics-400 dataset and accelerated with TensorRT P.S Click on

39 Nov 21, 2022

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)

Image Classification Project Killer in PyTorch This repo is designed for those who want to start their experiments two days before the deadline and ki

349 Dec 8, 2022

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

169 Dec 26, 2022

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

CLIP-ViL In our paper "How Much Can CLIP Benefit Vision-and-Language Tasks?", we show the improvement of CLIP features over the traditional resnet fea

310 Dec 28, 2022

PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

R2Plus1D-PyTorch PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal

342 Dec 16, 2022

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

MoCo v3 for Self-supervised ResNet and ViT Introduction This is a PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT. The original M

887 Jan 8, 2023

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

This is a playground for pytorch beginners, which contains predefined models on popular dataset. Currently we support mnist, svhn cifar10, cifar100 st

2.4k Dec 28, 2022

RRL: Resnet as representation for Reinforcement Learning

Related tags

Overview

Quick Links

RRL: Resnet as representation for Reinforcement Learning

Setup

Running Instructions

You might also like...

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

Eff video representation - Efficient video representation through neural fields

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021.

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Self-supervised learning on Graph Representation Learning (node-level task)

Comments

Docker image for adroit?

Demonstration has no seed?

Using cpu for testing?

How to turn on visual distractions?

Owner

Meta Research

3D ResNet Video Classification accelerated by TensorRT

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Reproduces ResNet-V3 with pytorch

DeepLab resnet v2 model in pytorch

Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet