Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild.

Chen Guo

Last update: Dec 24, 2022

Related tags

Overview

Human Performance Capture from Monocular Video in the Wild

Paper | Video | Project Page

Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild. We propose a method capable of capturing the dynamic 3D human shape from a monocular video featuring challenging body poses, without any additional input.

If you find our code or paper useful, please cite as

@inproceedings{guo2021human,
  title={Human Performance Capture from Monocular Video in the Wild},
  author={Guo, Chen and Chen, Xu and Song, Jie and Hilliges, Otmar},
  booktitle={2021 International Conference on 3D Vision (3DV)},
  pages={889--898},
  year={2021},
  organization={IEEE}
}

Quick Start

CLone this repo:

git clone https://github.com/MoyGcc/hpcwild.git
cd  hpcwild
conda env create -f environment.yml
conda activate hpcwild

Additional Dependencies:

Kaolin 0.1.0 (https://github.com/NVIDIAGameWorks/kaolin)
MPI mesh library (https://github.com/MPI-IS/mesh)
torch-mesh-isect (https://github.com/vchoutas/torch-mesh-isect)

Download SMPL models (1.0.0 for Python 2.7 (10 shape PCs)) and move them to the corresponding places:

mkdir lib/smpl/smpl_model/
mv /path/to/smpl/models/basicModel_f_lbs_10_207_0_v1.0.0.pkl smpl_rendering/smpl_model/SMPL_FEMALE.pkl
mv /path/to/smpl/models/basicmodel_m_lbs_10_207_0_v1.0.0.pkl smpl_rendering/smpl_model/SMPL_MALE.pkl

Download checkpoints for external modules:

wget https://download.01.org/opencv/openvino_training_extensions/models/human_pose_estimation/checkpoint_iter_370000.pth
mv /path/to/checkpoint_iter_370000.pth external/lightweight-human-pose-estimation.pytorch/checkpoint_iter_370000.pth

wget https://dl.fbaipublicfiles.com/pifuhd/checkpoints/pifuhd.pt pifuhd.pt 
mv /path/to/pifuhd.pt external/pifuhd/checkpoints/pifuhd.pt

Download IPNet weights: https://datasets.d2.mpi-inf.mpg.de/IPNet2020/IPNet_p5000_01_exp_id01.zip
unzip IPNet_p5000_01_exp_id01.zip
mv /path/to/IPNet_p5000_01_exp_id01 registration/experiments/IPNet_p5000_01_exp_id01

gdown --id 1mcr7ALciuAsHCpLnrtG_eop5-EYhbCmz -O modnet_photographic_portrait_matting.ckpt
mv /path/to/modnet_photographic_portrait_matting.ckpt external/MODNet/pretrained/modnet_photographic_portrait_matting.ckpt

Test on 3DPW dataset

Download 3DPW dataset

modify the dataset_path in test.conf.
run bash mesh_recon.sh to obtain the rigid body shape.
run bash registration.sh to register a SMPL+D model to the rigid human body.
run bash tracking.sh to capture the human performance temporally.

Test on your own video

run OpenPose to obtain the 2D keypoints.
run LGD to acquire the initial 3D poses.
run MODNet to extract sihouettes.

Acknowledgement

We use the code in PIFuHD for the rigid body construction and adapt IPNet for human model registration. We use off-the-shelf methods OpenPose and MODNet for the extraction of 2D keypoints and sihouettes. We sincerely thank these authors for their awesome work.

You might also like...

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

235 Dec 26, 2022

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video

45 Nov 29, 2022

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video Project Page | Paper NeuralRecon: Real-Time Coherent 3D Reconstruction from Mon

1.4k Dec 30, 2022

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

LASR Installation Build with conda conda env create -f lasr.yml conda activate lasr # install softras cd third_party/softras; python setup.py install;

157 Dec 26, 2022

Towards Multi-Camera 3D Human Pose Estimation in Wild Environment

PanopticStudio Toolbox This repository has a toolbox to download, process, and visualize the Panoptic Studio (Panoptic) data. Note: Sep-21-2020: Curre

335 Jan 9, 2023

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

NeuralTextures This is repository with inference code for paper "StylePeople: A Generative Model of Fullbody Human Avatars" (CVPR21). This code is for

Visual Understanding Lab @ Samsung AI Center Moscow

18 Oct 6, 2022

Open source repository for the code accompanying the paper 'Non-Rigid Neural Radiance Fields Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video'.

Non-Rigid Neural Radiance Fields This is the official repository for the project "Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synt

296 Dec 29, 2022

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos. In this project, we provide the basic code for fitt

2.2k Jan 5, 2023

HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR. CVPR 2022

HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR. CVPR 2022 [Project page | Video] Getting sta

51 Nov 29, 2022

Comments

torch-mesh-isect is not compatible with your environment.yml

Thank you for publishing this code. It is quite intriguing! Unfortunately, torch-mesh-isect does not work with the environment.yml you have published. I presume it because of an incompatibility with CUDA versions greater than 9. I am unable to compile and install it with CUDA 10.2, and it would appear to be an issue for all v10 installations.

Is it possible to eliminate or workaround the torch-mesh-isect dependency? Perhaps use a similar (but more up-to-date) library?

opened by VisionaryMind 3
Multiple changes to dependencies and code required
I have been able to get your project working on my end, but I ran into several obstacles along the way and wanted to share the modifications required to make installation more generic. Here are the steps I took:

Create a test_images directory at project ROOT (it was missing).

Include additional PIP dependencies: pyhocon, ipykernel, pycocotools, chumpy, smplpytorch

Install PyMCubes with pip install git+https://github.com/pmneila/PyMCubes.git

Git clone NuScenes-Devkit from https://github.com/nutonomy/nuscenes-devkit.git and install from setup directory.

Manually install PyTorch3d with pip install git+https://github.com/facebookresearch/[email protected]

Further, there are several places the code needs to change, because it is pointing to your personal home directory (/home/chen). Here are the locations:

registration/lib th_SMPL.py

smpl_right_hand_idx = np.load('assets/smpl_right_hand_idx.npy') smpl_left_hand_idx = np.load('assets/smpl_left_hand_idx.npy')

smpl_layer.py

self.model_path = 'smpl_rendering/smpl_model/SMPL_FEMALE.pkl' self.model_path = 'smpl_rendering/smpl_model/SMPL_MALE.pkl'

smpl_paths.py fname = 'registration/faces.npy'

registration/smpl_registration fit_SMPLD_final.py part_labels = pkl.load(open('assets/smpl_parts_dense.pkl', 'rb'))

smpl_rendering smpl_renderer.py _, faces, aux = load_obj('smpl_rendering/text_uv_coor_smpl.obj', load_textures=True)

Next, the smpl_parts_dense.pkl file is missing from IPNet, so those assets needed to be copied over as well.

With these changes, the entire pipeline works. The only question I have for you is how you have generated the sequence in your teaser with the normals overlay. This is specifically what I would like to capture (not the OBJ sequence in .pkl). The resulting front-view output is the mesh overlay. Please be so kind as to indicate where in the code I can enable normals output.

Also, one more suggestion for you (if you will have desire + time): I find the OpenPose library to be very inaccurate for many complex movements. For example, the fencing sequence --- it breaks down half-way into the movement by flipping the character's direction:

If you will replace OpenPose with HuMoR, this project would be in a league of its own. Please consider it. I would help, but I have very little time to spare.
opened by VisionaryMind 1

Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild.

Related tags

Overview

Human Performance Capture from Monocular Video in the Wild

Paper | Video | Project Page

Quick Start

Test on 3DPW dataset

Test on your own video

Acknowledgement

You might also like...

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Towards Multi-Camera 3D Human Pose Estimation in Wild Environment

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

Open source repository for the code accompanying the paper 'Non-Rigid Neural Radiance Fields Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video'.

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR. CVPR 2022

Comments

torch-mesh-isect is not compatible with your environment.yml

Multiple changes to dependencies and code required

Owner

Chen Guo

Code for ECCV 2020 paper "Contacts and Human Dynamics from Monocular Video".

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

Official Pytorch implementation of "Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes", CVPR 2022

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"

RGBD-Net - This repository contains a pytorch lightning implementation for the 3DV 2021 RGBD-Net paper.

Code release for ICCV 2021 paper "Anticipative Video Transformer"

Official implementation of the network presented in the paper "M4Depth: A motion-based approach for monocular depth estimation on video sequences"

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"