Demo code for paper "Learning optical flow from still images", CVPR 2021.

Last update: Dec 25, 2022

Related tags

Deep Learning depthstillation

Overview

Depthstillation

Demo code for "Learning optical flow from still images", CVPR 2021.

[Project page] - [Paper] - [Supplementary]

This code is provided to replicate the qualitative results shown in the supplementary material, Sections 2-4. The code has been tested using Ubuntu 20.04 LTS, python 3.8 and gcc 9.3.0

Reference

If you find this code useful, please cite our work:

@inproceedings{Aleotti_CVPR_2021,
  title     = {Learning optical flow from still images},
  author    = {Aleotti, Filippo and
               Poggi, Matteo and
               Mattoccia, Stefano},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2021}
}

Introduction
Usage
Supplementary
Weights
Contacts
Acknowledgments

Introduction

This paper deals with the scarcity of data for training optical flow networks, highlighting the limitations of existing sources such as labeled synthetic datasets or unlabeled real videos. Specifically, we introduce a framework to generate accurate ground-truth optical flow annotations quickly and in large amounts from any readily available single real picture. Given an image, we use an off-the-shelf monocular depth estimation network to build a plausible point cloud for the observed scene. Then, we virtually move the camera in the reconstructed environment with known motion vectors and rotation angles, allowing us to synthesize both a novel view and the corresponding optical flow field connecting each pixel in the input image to the one in the new frame. When trained with our data, state-of-the-art optical flow networks achieve superior generalization to unseen real data compared to the same models trained either on annotated synthetic datasets or unlabeled videos, and better specialization if combined with synthetic images.

Usage

Install the project requirements in a new python 3 environment:

virtualenv -p python3 learning_flow_env
source learning_flow_env/bin/activate
pip install -r requirements.txt

Compile the forward_warping module, written in C (required to handle warping collisions):

cd external/forward_warping
bash compile.sh
cd ../..

You are now ready to run the depthstillation.py script:

python depthstillation.py

By switching some parameters you can generate all the qualitatives provided in the supplementary material.

These parameters are:

num_motions: changes the number of virtual motions
segment: enables instance segmentation (for independently moving objects)
mask_type: mask selection. Options are H' and H
num_objects: sets the number of independently moving objects (one, in this example)
no_depth: disables monocular depth and force depth to assume a constant value
no_sharp: disables depth sharpening
change_k: uses different intrinsics K
change_motion: samples a different motion (ignored if num_motions greater than 1)

For instance, to simulate a different K settings, just run:

python depthstillation.py --change_k

The results are saved in dCOCO folder, organized as follows:

depth_color: colored depth map
flow: generated flow labels (in 16bit KITTI format)
flow_color: colored flow labels
H: H mask
H': H' mask
im0: real input image
im1: generated virtual image
im1_raw: generated virtual image (pre-inpainting)
instances_color: colored instance map (if --segment is enabled)
M: M mask
M': M' mask
P: P mask

We report the list of files used to depthstill dCOCO in samples/dCOCO_file_list.txt

Supplementary

We report here the list of commands to obtain, in the same order, the Figures shown in Sections 2-4 of the Supplementary Material:

Section 2 -- the first figure is obtained with default parameters, then we use --no_depth and --no_depth --segment respectively
Section 3 -- the first figure is obtained with --no_sharp, the remaining figures with default parameters or by setting --mask_type "H".
Section 4 -- we show three times the results obtained by default parameters, followed respectively by figures generated using --change_k, --change_motion and --segment individually.

Weights

We provide RAFT models trained in our experiments. To run them and reproduce our results, please refer to RAFT repository:

Tab. 4 (C) dCOCO (D) Ch->Th->dCOCO
Tab. 5 (C) dCOCO (fine-tuned) (D) Ch->Th->dCOCO (fine-tuned)
Tab. 7 (C) dDAVIS
Tab. 8 (C) dKITTI

Contacts

m [dot] poggi [at] unibo [dot] it

Acknowledgments

Thanks to Clément Godard and Niantic for sharing monodepth2 code, used to simulate camera motion.

Our work is inspired by Jamie Watson et al., Learning Stereo from Single Images.

Comments

Png format to .flo format

Hello, Thank you for your great code. How did you train the models for optical flow where the ground truth should be in .flow but your code generates .png format?

Thank you.

opened by zhilaAI 10
Optic flow generation on custom dataset

Hello, Thank you for your great idea and code. How can we apply your code to a custom dataset? I have video frames and want to generate the optic flow for my dataset. How can I use your code? Thank you.

opened by zhilaAI 2
The result is strange in win10

Hi! Thanks for your great work.

I try to run the code in win10, and use gcc -shared -o libwarping.dll warping.c instead of compile.sh. I modified lib = cdll.LoadLibrary("external/forward_warping/libwarping.so") to lib = cdll.LoadLibrary("external/forward_warping/libwarping.dll"). Then I run depthstillation.py but obtain strange result like that (dCOCO/im1): which is quite different from the result in supply. material. Did I miss something?

opened by 07hyx06 2
warped image is strange

With command python depthstillation.py, I have tried to get warp of your sample image im0 and the first cat image in your article with code, but the result was rather strange. The cat image was a screenshot from your article and its depth map was gotten using Miads with weight "dpt_large". Here is link to the warp result of this two image im1_warped and cat_warped. Am I doing somthing wrong?

opened by wyxgoishin 1
Typo in the comment

https://github.com/mattpoggi/depthstillation/blob/a2fafcc8d78d55e77f831777c78f82e105a5f3f1/depthstillation.py#L186

I think it should be Random angles in -pi/18,pi/18, excluding -pi/36,pi/36 to avoid zeros / very small rotations.

opened by 07hyx06 1

The generation process of depth image

I use MiDaS and follow the instructions in https://pytorch.org/hub/intelisl_midas_v2/ to get the depth image, white the depth of the demo image I get is as follows: demo

It seems very different from the depth image provided in this repo: demo2

the code I use is like this:

midas = torch.hub.load("intel-isl/MiDaS", "MiDaS").cuda()
midas_transforms = torch.hub.load("intel-isl/MiDaS", "transforms")
transform = midas_transforms.default_transform

image = np.asarray(Image.open(input_image))
img = transform(image)
with torch.no_grad():
    prediction = midas(img.cuda())
    # prediction = torch.nn.functional.interpolate(
    #     prediction.unsqueeze(1),
    #     size=image.shape[:2],
    #     mode="bicubic",
    #     align_corners=False,
    # ).squeeze()
depth_image = prediction.cpu().numpy().squeeze()
depth_image = cv2.convertScaleAbs(depth_image, alpha=255/np.max(depth_image))
Image.fromarray(img, mode="L").save("demo.png")

Could you please give some details about how to get the depth image?

opened by Victarry 1

Depthstillation on real dataset

Good evening, I'd like to reproduce your depthstillation process, but on real dataset called CADDY in order to evaluate some models (pwcnet and raft). I already extracted the depth maps through MIDAS and depthstilled them, but it seems that the metrics of the models that i trained on these depthstilled data are worsening a lot... I'm attaching, from top to bottom, computed flow, depth map, original and depthstilled images.

Do you see something strange in them? Any tips about the use case? Thanks in advance!

opened by Salvatore-tech 4

Owner

GitHub

Demo code for ICCV 2021 paper "Sensor-Guided Optical Flow"

Sensor-Guided Optical Flow Demo code for "Sensor-Guided Optical Flow", ICCV 2021 This code is provided to replicate results with flow hints obtained f

10 Mar 16, 2022

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking Ning Wang, Wengang Zhou, Jie Wang, and Houqiang Li Accepted by CVPR

236 Dec 22, 2022

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation Prerequisite Please create and activate the following conda envrionment. To r

87 Jan 8, 2023

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

Neural Attention Distillation This is an implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep

84 Jan 4, 2023

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

InversePrompting Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting Code: The code is provided in the "chinese_ip"

101 Dec 16, 2022

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, CVPR 2021. Ayan Kumar Bhunia, Pinaki nath Chowdhury, Yongxin Yan

44 Dec 12, 2022

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Neural Deformation Graphs Project Page | Paper | Video Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction Aljaž Božič, Pablo P

134 Dec 16, 2022

Demo code for paper "Learning optical flow from still images", CVPR 2021.

Related tags

Overview

Depthstillation

Reference

Contents

Introduction

Usage

Supplementary

Weights

Contacts

Acknowledgments

Comments

Png format to .flo format

Optic flow generation on custom dataset

The result is strange in win10

warped image is strange

Typo in the comment

The generation process of depth image

Depthstillation on real dataset

Owner

Demo code for ICCV 2021 paper "Sensor-Guided Optical Flow"

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

Code for our CVPR 2021 paper "MetaCam+DSCE"

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Official code for the paper: Deep Graph Matching under Quadratic Constraint (CVPR 2021)

Code for CVPR 2021 paper: Anchor-Free Person Search

Code of paper "CDFI: Compression-Driven Network Design for Frame Interpolation", CVPR 2021

Official code for the CVPR 2021 paper "How Well Do Self-Supervised Models Transfer?"

Code for the upcoming CVPR 2021 paper

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

Code for CVPR 2021 oral paper "Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts"

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.