PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

Wenlong Huang

Last update: Nov 22, 2022

Related tags

Deep Learning reinforcement-learning deep-learning point-cloud artificial-intelligence dexterous dexterous-robotic-hand

Overview

Generalization in Dexterous Manipulation via
Geometry-Aware Multi-Task Learning

[Project Page] [Paper]

Wenlong Huang¹, Igor Mordatch², Pieter Abbeel¹, Deepak Pathak³

¹University of California, Berkeley, ²Google Brain, ³Carnegie Mellon University

This is a PyTorch implementation of our Geometry-Aware Multi-Task Policy. The codebase also includes a suite of dexterous manipulation environments with 114 diverse real-world objects built upon Gym and MuJoCo.

We show that a single generalist policy can perform in-hand manipulation of over 100 geometrically-diverse real-world objects and generalize to new objects with unseen shape or size. Interestingly, we find that multi-task learning with object point cloud representations not only generalizes better but even outperforms the single-object specialist policies on both training as well as held-out test objects.

If you find this work useful in your research, please cite using the following BibTeX:

@article{huang2021geometry,
  title={Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning},
  author={Huang, Wenlong and Mordatch, Igor and Abbeel, Pieter and Pathak, Deepak},
  journal={arXiv preprint arXiv:2111.03062},
  year={2021}
}

Setup

Requirements

Python=3.6.9
CUDA=10.2
CUDNN=7.6.5
MuJoCo=1.50 (Installation Instructions)

Setup Instructions

git clone https://github.com/huangwl18/geometry-dex.git
cd geometry-dex/
conda create --name geometry-dex-env python=3.6.9
conda activate geometry-dex-env
pip install --upgrade pip
pip install -r requirements.txt
bash install-baselines.sh

Running Code

Below are some flags and parameters for run_ddpg.py that you may find useful for reference:

Flags and Parameters	Description
`--expID <INT>`	Experiment ID
`--train_names <List of STRING>`	list of environments for training; separated by space
`--test_names <List of STRING>`	list of environments for zero-shot testing; separated by space
`--point_cloud`	Use geometry-aware policy
`--pointnet_load_path <INT>`	Experiment ID from which to load the pre-trained Pointnet; required for `--point_cloud`
`--video_count <INT>`	Number of videos to generate for each env per cycle; only up to `1` is currently supported; `0` to disable
`--n_test_rollouts <INT>`	Total number of collected rollouts across all train + test envs for each evaluation run; should be multiple of `len(train_names) + len(test_names)`
`--num_rollouts <INT>`	Total number of collected rollouts across all train envs for 1 training cycle; should be multiple of `len(train_names)`
`--num_parallel_envs <INT>`	Number of parallel envs to create for `vec_env`; should be multiple of `len(train_names)`
`--chunk_size <INT>`	Number of parallel envs asigned to each worker in `SubprocChunkVecEnv`; `0` to disable and use `SubprocVecEnv`
`--num_layers <INT>`	Number of layers in MLP for all policies
`--width <INT>`	Width of each layer in MLP for all policies
`--seed <INT>`	seed for Gym, PyTorch and NumPy
`--eval`	Perform only evaluation using latest checkpoint
`--load_path <INT>`	Experiment ID from which to load the checkpoint for DDPG; required for `--eval`

The code also uses WandB. You may wish to run wandb login in terminal to record to your account or choose to run anonymously.

WARNING: Due to the large number of total environments, generating videos during training can be slow and memory intensive. You may wish to train the policy without generating videos by passing video_count=0. After training completes, simply run run_ddpg.py with flags --eval and --video_count=1 to visualize the policy. See example below.

Training

To train Vanilla Multi-Task DDPG policy:

python run_ddpg.py --expID 1 --video_count 0 --n_cycles 40000 --chunk 10

To train Geometry-Aware Multi-Task DDPG policy, first pretrain PointNet encoder:

python train_pointnet.py --expID 2

Then train the policy:

python run_ddpg.py --expID 3 --video_count 0 --n_cycles 40000 --chunk 10 --point_cloud --pointnet_load_path 2 --no_save_buffer

Note we don't save replay buffer here because it is slow as it contains sampled point clouds. If you wish to resume training in the future, do not pass --no_save_buffer above.

Evaluation / Visualization

To evaluate a trained policy and generate video visualizations, run the same command used to train the policy but with additional flags --eval --video_count=<VIDEO_COUNT> --load_path=<LOAD_EXPID>. Replace <VIDEO_COUNT> with 1 if you wish to enable visualization and 0 otherwise. Replace <LOAD_EXPID> with the Experiment ID of the trained policy. For a Geometry-Aware Multi-Task DDPG policy trained using above command, run the following for evaluation and visualization:

python run_ddpg.py --expID 4 --video_count 1 --n_cycles 40000 --chunk 10 --point_cloud --pointnet_load_path 2 --no_save_buffer --eval --load_path 3

Trained Models

We will be releasing trained model files for our Geometry-Aware Policy and single-task oracle policies for each individual object. Stay tuned! Early access can be requested via email.

Provided Environments

Training Envs
_{e_toy_airplane}	_knife	_{flat_screwdriver}	_elephant	_apple
_scissors	_{i_cups}	_cup	_{foam_brick}	_{pudding_box}
_wristwatch	_padlock	_{power_drill}	_binoculars	_{b_lego_duplo}
_{ps_controller}	_mouse	_hammer	_{f_lego_duplo}	_{piggy_bank}
_can	_{extra_large_clamp}	_peach	_{a_lego_duplo}	_racquetball
_{tuna_fish_can}	_{a_cups}	_pan	_strawberry	_{d_toy_airplane}
_{wood_block}	_{small_marker}	_{sugar_box}	_ball	_torus
_{i_toy_airplane}	_chain	_{j_cups}	_{c_toy_airplane}	_airplane
_{nine_hole_peg_test}	_{water_bottle}	_{c_cups}	_{medium_clamp}	_{large_marker}
_{h_cups}	_{b_colored_wood_blocks}	_{j_lego_duplo}	_{f_toy_airplane}	_toothbrush
_{tennis_ball}	_mug	_sponge	_{k_lego_duplo}	_{phillips_screwdriver}
_{f_cups}	_{c_lego_duplo}	_{d_marbles}	_{d_cups}	_camera
_{d_lego_duplo}	_{golf_ball}	_{k_toy_airplane}	_{b_cups}	_softball
_{wine_glass}	_{chips_can}	_cube	_{master_chef_can}	_{alarm_clock}
_{gelatin_box}	_{h_lego_duplo}	_baseball	_{light_bulb}	_banana
_{rubber_duck}	_headphones	_{i_lego_duplo}	_{b_toy_airplane}	_{pitcher_base}
_{j_toy_airplane}	_{g_lego_duplo}	_{cracker_box}	_orange	_{e_cups}

Test Envs
_{rubiks_cube}	_dice	_{bleach_cleanser}	_pear	_{e_lego_duplo}
_pyramid	_stapler	_flashlight	_{large_clamp}	_{a_toy_airplane}
_{tomato_soup_can}	_fork	_{cell_phone}	_{m_lego_duplo}	_toothpaste
_flute	_{stanford_bunny}	_{a_marbles}	_{potted_meat_can}	_timer
_lemon	_{utah_teapot}	_train	_{g_cups}	_{l_lego_duplo}
_bowl	_{door_knob}	_{mustard_bottle}	_plum

Acknowledgement

The code is adapted from this open-sourced implementation of DDPG + HER. The object meshes are from the YCB Dataset and the ContactDB Dataset. We use SubprocChunkVecEnv from this pull request of OpenAI Baselines to speedup vectorized environments.

You might also like...

A pytorch implementation of Pytorch-Sketch-RNN

Pytorch-Sketch-RNN A pytorch implementation of https://arxiv.org/abs/1704.03477 In order to draw other things than cats, you will find more drawing da

172 Dec 12, 2022

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Advantage async actor-critic Algorithms (A3C) in PyTorch @inproceedings{mnih2016asynchronous, title={Asynchronous methods for deep reinforcement lea

111 Dec 8, 2022

A bunch of random PyTorch models using PyTorch's C++ frontend

PyTorch Deep Learning Models using the C++ frontend Gettting started Clone the repo 1. https://github.com/mrdvince/pytorchcpp 2. cd fashionmnist or

0 Jul 13, 2021

PyTorch Autoencoders - Implementing a Variational Autoencoder (VAE) Series in Pytorch.

PyTorch Autoencoders Implementing a Variational Autoencoder (VAE) Series in Pytorch. Inspired by this repository Model List check model paper conferen

8 Nov 21, 2022

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

PyTorch-LIT PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices. With

157 Dec 11, 2022

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

torchx Torchx is a general framework for deep learning experiments under PyTorch based on pytorch-lightning. TODO list gan-like training wrapper text

6 Mar 17, 2022

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Introduction This is a Python package available on PyPI for NVIDIA-maintained utilities to streamline mixed precision and distributed training in Pyto

5 Sep 29, 2021

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

PyTorch implementation of 'Denoising Diffusion Probabilistic Models' This reposi

76 Jan 7, 2023

RetinaNet-PyTorch - A RetinaNet Pytorch Implementation on remote sensing images and has the similar mAP result with RetinaNet in MMdetection

🚀 RetinaNet Horizontal Detector Based PyTorch This is a horizontal detector Ret

13 Nov 19, 2022

Comments

task be killed when visulization

hi huang : thx for your great job on geometry-dex! it gives a great inspiration for me. i have some problems when i want to make a visulization, and according to termial output i cannot find any reason, i hope you could help me for this.

python run_ddpg.py --expID 4 --video_count 1 --n_cycles 40000 --chunk 10 --point_cloud --pointnet_load_path 2 --no_save_buffer --eval --load_path 3

`*** overwriting n_batches from 40 to 3812 *** adjusted total buffer size from 8.5000e+07 to 1e7 running build_ext

** creating 425 training vec env.. created (time taken: 3.39 s)! **

** creating 570 eval vec env.. created (time taken: 5.47 s)! **

** start from loaded checkpoint

/home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorflow/python/framework/dtypes.py:516: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_qint8 = np.dtype([("qint8", np.int8, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorflow/python/framework/dtypes.py:517: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_quint8 = np.dtype([("quint8", np.uint8, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorflow/python/framework/dtypes.py:518: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_qint16 = np.dtype([("qint16", np.int16, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorflow/python/framework/dtypes.py:519: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_quint16 = np.dtype([("quint16", np.uint16, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorflow/python/framework/dtypes.py:520: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_qint32 = np.dtype([("qint32", np.int32, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorflow/python/framework/dtypes.py:525: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. np_resource = np.dtype([("resource", np.ubyte, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorboard/compat/tensorflow_stub/dtypes.py:541: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_qint8 = np.dtype([("qint8", np.int8, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorboard/compat/tensorflow_stub/dtypes.py:542: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_quint8 = np.dtype([("quint8", np.uint8, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorboard/compat/tensorflow_stub/dtypes.py:543: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_qint16 = np.dtype([("qint16", np.int16, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorboard/compat/tensorflow_stub/dtypes.py:544: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_quint16 = np.dtype([("quint16", np.uint16, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorboard/compat/tensorflow_stub/dtypes.py:545: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. _np_qint32 = np.dtype([("qint32", np.int32, 1)]) /home/zhangq/anaconda3/envs/geometry-dex-env/lib/python3.6/site-packages/tensorboard/compat/tensorflow_stub/dtypes.py:550: FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'. np_resource = np.dtype([("resource", np.ubyte, 1)])

** evaluating over 570 episodes each env over 114 train + test envs Killed `

opened by zituka 3
May I Know the Hardware requirements for replicating similar results

Thanks for releasing this. I just wanted to know the hardware used for this study and if it is possible to replicate it in general PC given sufficient time.

opened by leonasting 0

Owner

Wenlong Huang

Undergraduate Student @ UC Berkeley

GitHub https://huangwl18.github.io/geometry-dex/

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

Amazon Forest Computer Vision Satellite Image tagging code using PyTorch / Keras Here is a sample of images we had to work with Source: https://www.ka

359 Jan 5, 2023

A code generator from ONNX to PyTorch code

onnx-pytorch Generating pytorch code from ONNX. Currently support onnx==1.9.0 and torch==1.8.1. Installation From PyPI pip install onnx-pytorch From

94 Jan 6, 2023

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Essential BYOL A simple and complete implementation of Bootstrap your own latent: A new approach to self-supervised Learning in PyTorch + PyTorch Ligh

48 Sep 27, 2022

RealFormer-Pytorch Implementation of RealFormer using pytorch

RealFormer-Pytorch Implementation of RealFormer using pytorch. Includes comparison with classical Transformer on image classification task (ViT) wrt C

90 Dec 8, 2022

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

NN Template Generic template to bootstrap your PyTorch project. Click on Use this Template and avoid writing boilerplate code for: PyTorch Lightning,

520 Dec 30, 2022

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

This repository holds NVIDIA-maintained utilities to streamline mixed precision and distributed training in Pytorch. Some of the code here will be included in upstream Pytorch eventually. The intention of Apex is to make up-to-date utilities available to users as quickly as possible.

6.9k Jan 3, 2023

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

Related tags

Overview

Generalization in Dexterous Manipulation via
Geometry-Aware Multi-Task Learning

[Project Page] [Paper]

Setup

Requirements

Setup Instructions

Running Code

Training

Evaluation / Visualization

Trained Models

Provided Environments

Acknowledgement

You might also like...

A pytorch implementation of Pytorch-Sketch-RNN

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

A bunch of random PyTorch models using PyTorch's C++ frontend

PyTorch Autoencoders - Implementing a Variational Autoencoder (VAE) Series in Pytorch.

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

RetinaNet-PyTorch - A RetinaNet Pytorch Implementation on remote sensing images and has the similar mAP result with RetinaNet in MMdetection

Comments

task be killed when visulization

May I Know the Hardware requirements for replicating similar results

Owner

Wenlong Huang

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

A code generator from ONNX to PyTorch code

An essential implementation of BYOL in PyTorch + PyTorch Lightning

RealFormer-Pytorch Implementation of RealFormer using pytorch

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Objective of the repository is to learn and build machine learning models using Pytorch. 30DaysofML Using Pytorch

Pretrained SOTA Deep Learning models, callbacks and more for research and production with PyTorch Lightning and PyTorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

Related tags

Overview

Generalization in Dexterous Manipulation viaGeometry-Aware Multi-Task Learning

[Project Page] [Paper]

Setup

Requirements

Setup Instructions

Running Code

Training

Evaluation / Visualization

Trained Models

Provided Environments

Acknowledgement

You might also like...

A pytorch implementation of Pytorch-Sketch-RNN

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

A bunch of random PyTorch models using PyTorch's C++ frontend

PyTorch Autoencoders - Implementing a Variational Autoencoder (VAE) Series in Pytorch.

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

RetinaNet-PyTorch - A RetinaNet Pytorch Implementation on remote sensing images and has the similar mAP result with RetinaNet in MMdetection

Comments

task be killed when visulization

May I Know the Hardware requirements for replicating similar results

Owner

Wenlong Huang

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

A code generator from ONNX to PyTorch code

An essential implementation of BYOL in PyTorch + PyTorch Lightning

RealFormer-Pytorch Implementation of RealFormer using pytorch

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Objective of the repository is to learn and build machine learning models using Pytorch. 30DaysofML Using Pytorch

Pretrained SOTA Deep Learning models, callbacks and more for research and production with PyTorch Lightning and PyTorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

Generalization in Dexterous Manipulation via
Geometry-Aware Multi-Task Learning