Code for "Layered Neural Rendering for Retiming People in Video."

Google

Last update: Dec 16, 2022

Related tags

Deep Learning computer-vision deep-learning pytorch video-decomposition retiming layer-decomposition

Overview

Layered Neural Rendering in PyTorch

This repository contains training code for the examples in the SIGGRAPH Asia 2020 paper "Layered Neural Rendering for Retiming People in Video."

This is not an officially supported Google product.

Prerequisites

Linux
Python 3.6+
NVIDIA GPU + CUDA CuDNN

Installation

This code has been tested with PyTorch 1.4 and Python 3.8.

Install PyTorch 1.4 and other dependencies.
- For pip users, please type the command pip install -r requirements.txt.
- For Conda users, you can create a new Conda environment using conda env create -f environment.yml.

Data Processing

Download the data for a video used in our paper (e.g. "reflection"):

bash ./datasets/download_data.sh reflection

Or alternatively, download all the data by specifying all.
Download the pretrained keypoint-to-UV model weights:

bash ./scripts/download_kp2uv_model.sh

The pretrained model will be saved at ./checkpoints/kp2uv/latest_net_Kp2uv.pth.

Generate the UV maps from the keypoints:

bash datasets/prepare_iuv.sh ./datasets/reflection

Training

To train a model on a video (e.g. "reflection"), run:

python train.py --name reflection --dataroot ./datasets/reflection --gpu_ids 0,1

To view training results and loss plots, visit the URL http://localhost:8097. Intermediate results are also at ./checkpoints/reflection/web/index.html.

You can find more scripts in the scripts directory, e.g. run_${VIDEO}.sh which combines data processing, training, and saving layer results for a video.

Note:

It is recommended to use >=2 GPUs, each with >=16GB memory.
The training script first trains the low-resolution model for --num_epochs at --batch_size, and then trains the upsampling module for --num_epochs_upsample at --batch_size_upsample. If you do not need the upsampled result, pass --num_epochs_upsample 0.
Training the upsampling module requires ~2.5x memory as the low-resolution model, so set batch_size_upsample accordingly. The provided scripts set the batch sizes appropriately for 2 GPUs with 16GB memory.
GPU memory scales linearly with the number of layers.

Saving layer results from a trained model

Run the trained model:

python test.py --name reflection --dataroot ./datasets/reflection --do_upsampling

The results (RGBA layers, videos) will be saved to ./results/reflection/test_latest/.
Passing --do_upsampling uses the results of the upsampling module. If the upsampling module hasn't been trained (num_epochs_upsample=0), then remove this flag.

Custom video

To train on your own video, you will have to preprocess the data:

Extract the frames, e.g.

mkdir ./datasets/my_video && cd ./datasets/my_video 
mkdir rgb && ffmpeg -i video.mp4 rgb/%04d.png

Resize the video to 256x448 and save the frames in my_video/rgb_256, and resize the video to 512x896 and save in my_video/rgb_512.
Run AlphaPose and Pose Tracking on the frames. Save results as my_video/keypoints.json
Create my_video/metadata.json following these instructions.
If your video has camera motion, either (1) stabilize the video, or (2) maintain the camera motion by computing homographies and saving as my_video/homographies.txt. See scripts/run_cartwheel.sh for a training example with camera motion, and see ./datasets/cartwheel/homographies.txt for formatting.

Note: Videos that are suitable for our method have the following attributes:

Static camera or limited camera motion that can be represented with a homography.
Limited number of people, due to GPU memory limitations. We tested up to 7 people and 7 layers. Multiple people can be grouped onto the same layer, though they cannot be individually retimed.
People that move relative to the background (static people will be absorbed into the background layer).
We tested a video length of up to 200 frames (~7 seconds).

Citation

If you use this code for your research, please cite the following paper:

@inproceedings{lu2020,
  title={Layered Neural Rendering for Retiming People in Video},
  author={Lu, Erika and Cole, Forrester and Dekel, Tali and Xie, Weidi and Zisserman, Andrew and Salesin, David and Freeman, William T and Rubinstein, Michael},
  booktitle={SIGGRAPH Asia},
  year={2020}
}

Acknowledgments

This code is based on pytorch-CycleGAN-and-pix2pix.

Comments

Mismatching keypoints format for my own datasets

Hi, I's trying to run the model with own video.

I just followed the instruction and checked the #2 closed issue to fix the error. However, I cannot get the keypoints with the specific format.

First, I tried to use " python scripts/demo_inference.py --cfg configs/coco/resnet/256x192_res50_lr1e-3_1x.yaml --checkpoint pretrained_models/fast_res50_256x192.pth --indir examples/demo/rgb_256/ --outdir examples/res" this to get the alphapose-result.json.

After that I used "python trackers/PoseFlow/tracker-general.py --imgdir examples/demo/rgb_256/ --in_json examples/res/alphapose-results.json --out_json examples/res/alphapose-results-forvis-tracked.json --visdir exa mples/res/vis/" to get alphapose-results-forvis-tracked.json.

but still the format is as shown as below {"png"; [{"keypoints": [...], "scores": #, "idx": #}], "png"; [{"keypoints": [...], "scores": #, "idx": #}], ...}

opened by hayoyo12 6
Mismatch between AlphaPose generated keypoints format and retiming code expected format

Hi. I'm interested in running inference on my own video. I've followed the instructions and got stuck at step 3 which specifies: "Run AlphaPose and Pose Tracking". I've ran AlphaPose with tracking (knowing the exact tracking configuration command you have used would be very useful by the way).

None of the data formats which have been specified in the official AlphaPose output formats is the one your code expects.

Your code expects keypoints in format [[x1,y1,1], [x2,y2,1] ....], but none of the output formats in the document I've shared supports that.

Could you please share how to do the conversion? Or perhaps a more detailed process of how to generate keypoints in this format?

opened by Hugstar 3
Training visualization

Hi,

I tried to check the training visualization with Reflection video. However, the predicted layers at epoch 100 are look different from the supplementary results. Did you change the mask loss/mask threshold/or lambda for mask loss?

You can check my results below.

Thank you.

opened by hayoyo12 2
kp2uv model

Hi Erika,

Congratulations on the great work. Very impressive.

I am trying to use the kp2uv model but I get weird results. Please find attached the kp_im and uv_output.

Any ideas why I am getting this.

Thanks!

opened by BadourAlBahar 4
Hosting problems

Hello, the website you're hosting is very slow. So I decided to make new folder in google drive, where you can download datasets:
https://drive.google.com/drive/folders/1BCKL_UWmbF3jk6iwN9wwPPKyNnFz-jD3?usp=sharing

opened by ghost 0

Owner

Google

Google ❤️ Open Source

GitHub https://retiming.github.io/

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

SLM: Structural Language Models of Code This is an official implementation of the model described in: "Structural Language Models of Code" [PDF] To ap

73 Nov 6, 2022

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

NeuralTextures This is repository with inference code for paper "StylePeople: A Generative Model of Fullbody Human Avatars" (CVPR21). This code is for

Visual Understanding Lab @ Samsung AI Center Moscow

18 Oct 6, 2022

A code generator from ONNX to PyTorch code

onnx-pytorch Generating pytorch code from ONNX. Currently support onnx==1.9.0 and torch==1.8.1. Installation From PyPI pip install onnx-pytorch From

94 Jan 6, 2023

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

KGI (Knowledge Graph Induction) for slot filling This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code fo

72 Jan 6, 2023

Convert Python 3 code to CUDA code.

Py2CUDA Convert python code to CUDA. Usage To convert a python file say named py_file.py to CUDA, run python generate_cuda.py --file py_file.py --arch

3 Jul 14, 2021

Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

Transformers for variable misuse, function naming and code completion tasks The official PyTorch implementation of: Empirical Study of Transformers fo

56 Nov 15, 2022

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

This repository is a toolkit to do machine learning for programming languages. It implements tokenization, dataset preprocessing, model training and m

408 Jan 1, 2023

Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

CoProtector Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

1 Oct 26, 2021

Low-code/No-code approach for deep learning inference on devices

EzEdgeAI A concept project that uses a low-code/no-code approach to implement deep learning inference on devices. It provides a componentized framewor

7 Apr 5, 2022

Code for all the Advent of Code'21 challenges mostly written in python

Advent of Code 21 Code for all the Advent of Code'21 challenges mostly written in python. They are not necessarily the best or fastest solutions but j

4 May 26, 2022

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

This codebase is being actively maintained, please create and issue if you have issues using it Basics All data files are included under losses and ea

32 Nov 9, 2021

Opinionated code formatter, just like Python's black code formatter but for Beancount

beancount-black Opinionated code formatter, just like Python's black code formatter but for Beancount Try it out online here Features MIT licensed - b

16 Oct 11, 2022

a delightful machine learning tool that allows you to train, test and use models without writing code

igel A delightful machine learning tool that allows you to train/fit, test and use models without writing code Note I'm also working on a GUI desktop

3k Jan 5, 2023

Pytorch Lightning code guideline for conferences

Deep learning project seed Use this seed to start new deep learning / ML projects. Built in setup.py Built in requirements Examples with MNIST Badges

1k Jan 2, 2023

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Auto-ViML Automatically Build Variant Interpretable ML models fast! Auto_ViML is pronounced "auto vimal" (autovimal logo created by Sanket Ghanmare) N

397 Dec 30, 2022

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

gtn_applications An applications library using GTN. Current examples include: Offline handwriting recognition Automatic speech recognition Installing

68 Dec 29, 2022

Code for "Layered Neural Rendering for Retiming People in Video."

Related tags

Overview

Layered Neural Rendering in PyTorch

Prerequisites

Installation

Data Processing

Training

Saving layer results from a trained model

Custom video

Citation

Acknowledgments

Comments

Mismatching keypoints format for my own datasets

Mismatch between AlphaPose generated keypoints format and retiming code expected format

Training visualization

kp2uv model

Hosting problems

Owner

Google

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

A code generator from ONNX to PyTorch code

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

Convert Python 3 code to CUDA code.

Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

Low-code/No-code approach for deep learning inference on devices

Code for all the Advent of Code'21 challenges mostly written in python

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Opinionated code formatter, just like Python's black code formatter but for Beancount

a delightful machine learning tool that allows you to train, test and use models without writing code

Pytorch Lightning code guideline for conferences

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Code samples for my book "Neural Networks and Deep Learning"

Code for: https://berkeleyautomation.github.io/bags/

Code for our method RePRI for Few-Shot Segmentation. Paper at http://arxiv.org/abs/2012.06166

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"