We are More than Our JOints: Predicting How 3D Bodies Move

Last update: Oct 20, 2022

Related tags

Deep Learning MOJO-release

Overview

We are More than Our JOints: Predicting How 3D Bodies Move

Citation

This repo contains the official implementation of our paper MOJO:

@inproceedings{Zhang:CVPR:2021,
  title = {We are More than Our Joints: Predicting how {3D} Bodies Move},
  author = {Zhang, Yan and Black, Michael J. and Tang, Siyu},
  booktitle = {Proceedings IEEE/CVF Conf.~on Computer Vision and Pattern Recognition (CVPR)},
  month = jun,
  year = {2021},
  month_numeric = {6}
}

License

We employ CC BY-NC-SA 4.0 for the MOJO code, which covers

models/fittingop.py
experiments/utils/batch_gen_amass.py
experiments/utils/utils_canonicalize_amass.py
experiments/utils/utils_fitting_jts2mesh.py
experiments/utils/vislib.py
experiments/vis_*_amass.py

The rest part are developed based on DLow. According to their license, the implementation follows its CMU license.

Environment & code structure

Tested OS: Linux Ubuntu 18.04
Packages:
- Python >= 3.6
- PyTorch >= 1.2
- Tensorboard
- smplx
- vposer
- others
Note: All scripts should be run from the root of this repo to avoid path issues. Also, please fix some path configs in the code, otherwise errors will occur.

Training

The training is split to two steps. Provided we have a config file in experiments/cfg/amass_mojo_f9_nsamp50.yml, we can do

python experiments/train_MOJO_vae.py --cfg amass_mojo_f9_nsamp50 to train the MOJO
python experiments/train_MOJO_dlow.py --cfg amass_mojo_f9_nsamp50 to train the DLow

Evaluation

These experiments/eval_*.py files are for evaluation. For eval_*_pred.py, they can be used either to evaluate the results while predicting, or to save results to a file for further evaluation and visualization. An example is python experiments/eval_kps_pred.py --cfg amass_mojo_f9_nsamp50 --mode vis, which is to save files to the folder results/amass_mojo_f9_nsamp50.

Generation

In MOJO, the recursive projection scheme is to get 3D bodies from markers and keep the body valid. The relevant implementation is mainly in models/fittingop.py and experiments/test_recursive_proj.py. An example to run is

python experiments/test_recursive_proj.py --cfg amass_mojo_f9_nsamp50 --testdata ACCAD --gpu_index 0

Datasets

In MOJO, we have used AMASS, Human3.6M, and HumanEva.

For Human3.6M and HumanEva, we follow the same pre-processing step as in DLow, VideoPose3D, and others. Please refer to their pages, e.g. this one, for details.

For AMASS, we perform canonicalization of motion sequences with our own procedures. The details are in experiments/utils_canonicalize_amass.py. We find this sequence canonicalization procedure is important. The canonicalized AMASS used in our work can be downloaded here, which includes the random sample names of ACCAD and BMLhandball used in our experiments about motion realism.

Models

For human body modeling, we employ the SMPL-X parametric body model. You need to follow their license and download. Based on SMPL-X, we can use the body joints or a sparse set of body mesh vertices (the body markers) to represent the body.

CMU It has 41 markers, the corresponding SMPL-X mesh vertex ID can be downloaded here.
SSM2 It has 64 markers, the corresponding SMPL-X mesh vertex ID can be downloaded here.
Joints We used 22 joints. No need to download, but just obtain them from the SMPL-X body model. See details in the code.

Our CVAE model configurations are in experiments/cfg. The pre-trained checkpoints can be downloaded here.

Related projects

AMASS: It unifies diverse motion capture data with the SMPL-H model, and provides a large-scale high-quality dataset. Its official codebase and tutorials are in this github repo.
GRAB: Most mocap data only contains the body motion. GRAB, however, provides high-quality data of human-object interactions. Besides capturing the body motion, the object motion and the hand-object contact are captured simultaneously. More demonstrations are in its github repo.

Acknowledgement & disclaimer

We thank Nima Ghorbani for the advice on the body marker setting and the {\bf AMASS} dataset. We thank Yinghao Huang, Cornelia K"{o}hler, Victoria Fern'{a}ndez Abrevaya, and Qianli Ma for proofreading. We thank Xinchen Yan and Ye Yuan for discussions on baseline methods. We thank Shaofei Wang and Siwei Zhang for their help with the user study and the presentation, respectively.

MJB has received research gift funds from Adobe, Intel, Nvidia, Facebook, and Amazon. While MJB is a part-time employee of Amazon, his research was performed solely at, and funded solely by, Max Planck. MJB has financial interests in Amazon Datagen Technologies, and Meshcapade GmbH.

You might also like...

This repository contains the code used for Predicting Patient Outcomes with Graph Representation Learning (https://arxiv.org/abs/2101.03940).

Predicting Patient Outcomes with Graph Representation Learning This repository contains the code used for Predicting Patient Outcomes with Graph Repre

76 Dec 22, 2022

PAWS 🐾 Predicting View-Assignments with Support Samples

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Preference-Planning-Deep-IRL Introduction Check my portfolio post Dependencies Gym stable-baselines3 PyTorch Usage Take Demonstration python3 record.

9 Oct 26, 2022

Comments

What's your pytorch version?

Thanks for your great job, now i meet the problem about torchgeometry version and torch version when I run pythonexperiments/test_recursive_proj.py --cfg amass_mojo_f9_nsamp50 --testdata ACCAD --gpu_index 0 . Could u tell me your test env?

Traceback (most recent call last): File "experiments/test_recursive_proj.py", line 222, in pred_with_proj(n_seqs=60, n_gens=50) File "experiments/test_recursive_proj.py", line 101, in pred_with_proj prev_pose_vp, prev_handpose) File "experiments/test_recursive_proj.py", line 39, in get_prediction prev_pose_vp, prev_handpose) File "/home/zzj/project/MOJO/models/fittingop.py", line 272, in decode_with_fitting y_i.detach()) File "/home/zzj/project/MOJO/models/fittingop.py", line 209, in fitting_subloop loss, verts= self.calc_loss(betas, verts_gt, ss) File "/home/zzj/project/MOJO/models/fittingop.py", line 180, in calc_loss body_param['global_orient'] = RotConverter.rotmat2aa(RotConverter.cont2rotmat(self.glo_rot_rec)) File "/home/zzj/project/MOJO/models/fittingop.py", line 64, in rotmat2aa pose = tgm.rotation_matrix_to_angle_axis(homogen_matrot).view(-1, 3).contiguous() File "/home/zzj/anaconda3/envs/mojo/lib/python3.6/site-packages/torchgeometry/core/conversions.py", line 233, in rotation_matrix_to_angle_axis quaternion = rotation_matrix_to_quaternion(rotation_matrix) File "/home/zzj/anaconda3/envs/mojo/lib/python3.6/site-packages/torchgeometry/core/conversions.py", line 302, in rotation_matrix_to_quaternion mask_c1 = mask_d2 * (1 - mask_d0_d1) File "/home/zzj/anaconda3/envs/mojo/lib/python3.6/site-packages/torch/tensor.py", line 325, in rsub return _C._VariableFunctions.rsub(self, other) RuntimeError: Subtraction, the - operator, with a bool tensor is not supported. If you are trying to invert a mask, use the ~ or bitwise_not() operator instead.

opened by FatherPrime 4
Questions about the DCT space

Marvellous job!! Since I new in this area, I wonder where do those DCT matrics (\eg dct_45.mat) in this come from and how to obtain a dct matrix with arbitrary dimension?

opened by SDOlivia 3
how to select markers?

Hi. thanks for sharing your code.

I wonder how to select the markers. Did you choose the 41, 67 markers manaully? Or is it selected by some algorithm? If it was selected through an algorithm, could you please tell me which algorithm you used?

Thanks.

opened by asw91666 1

We are More than Our JOints: Predicting How 3D Bodies Move

Related tags

Overview

We are More than Our JOints: Predicting How 3D Bodies Move

Citation

License

Environment & code structure

Training

Evaluation

Generation

Datasets

Models

Related projects

Acknowledgement & disclaimer

You might also like...

This repository contains the code used for Predicting Patient Outcomes with Graph Representation Learning (https://arxiv.org/abs/2101.03940).

PAWS 🐾 Predicting View-Assignments with Support Samples

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

A geometric deep learning pipeline for predicting protein interface contacts.

An end-to-end regression problem of predicting the price of properties in Bangalore.

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Comments

What's your pytorch version?

Questions about the DCT space

how to select markers?

Owner

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

Code of 3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces

SIEM Logstash parsing for more than hundred technologies

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

An AI Assistant More Than a Toolkit

A python bot to move your mouse every few seconds to appear active on Skype, Teams or Zoom as you go AFK. 🐭 🤖