An implementation for the ICCV 2021 paper Deep Permutation Equivariant Structure from Motion.

Last update: Dec 27, 2022

Related tags

Deep Learning Equivariant-SFM

Overview

Deep Permutation Equivariant Structure from Motion

Paper | Poster

This repository contains an implementation for the ICCV 2021 paper Deep Permutation Equivariant Structure from Motion.

The paper proposes a neural network architecture that, given a set of point tracks in multiple images of a static scene, recovers both the camera parameters and a (sparse) scene structure by minimizing an unsupervised reprojection loss. The method does not require initialization of camera parameters or 3D point locations and is implemented for two setups: (1) single scene reconstruction and (2) learning from multiple scenes.

Setup
How to use
Citation

Setup

This repository is implemented with python 3.8, and in order to run bundle adjustment requires linux.

Folders

The repository should contain the following folders:

Equivariant-SFM
├── bundle_adjustment
├── code
├── datasets
│   ├── Euclidean
│   └── Projective
├── environment.yml
├── results

Conda envorinment

Create the environment using one of the following commands:

conda create -n ESFM -c pytorch -c conda-forge -c comet_ml -c plotly  -c fvcore -c iopath -c bottler -c anaconda -c pytorch3d python=3.8 pytorch cudatoolkit=10.2 torchvision pyhocon comet_ml plotly pandas opencv openpyxl xlrd cvxpy fvcore iopath nvidiacub pytorch3d eigen cmake glog gflags suitesparse gxx_linux-64 gcc_linux-64 dask matplotlib
conda activate ESFM

Or:

conda env create -f environment.yml
conda activate ESFM

And follow the bundle adjustment instructions.

Data

Download the data from this link.

The model can work on both calibrated camera setting (euclidean reconstruction) and on uncalibrated cameras (projective reconstruction).

The input for the model is an observed points matrix of size [m,n,2] where the entry [i,j] is a 2D image point that corresponds to camera (image) number i and 3D point (point track) number j.

In practice we use a correspondence matrix representation of size [2*m,n], where the entries [2*i,j] and [2*i+1,j] form the [i,j] image point.

For the calibrated setting, the input must include m calibration matrices of size [3,3].

How to use

Optimization

For a calibrated scene optimization run:

python single_scene_optimization.py --conf Optimization_Euc.conf

For an uncalibrated scene optimization run:

python single_scene_optimization.py --conf Optimization_Proj.conf

The following examples are for the calibrated settings but are clearly the same for the uncalibrated setting.

You can choose which scene to optimize either by changing the config file in the field 'dataset.scan' or from the command line:

python single_scene_optimization.py --conf Optimization_Euc.conf --scan [scan_name]

Similarly, you can override any value of the config file from the command line. For example, to change the number of training epochs and the evaluation frequency use:

python single_scene_optimization.py --conf Optimization_Euc.conf --external_params "train:num_of_epochs:1e+5,train:eval_intervals:100"

Learning

To run the learning setup run:

python multiple_scenes_learning.py --conf Learning_Euc.conf

Or for the uncalibrated setting:

python multiple_scenes_learning.py --conf Learning_Proj.conf

To override some parameters from the config file, you can either change the file itself or use the same command as in the optimization setting:

python multiple_scenes_learning.py --conf Learning_Euc.conf --external_params "train:num_of_epochs:1e+5,train:eval_intervals:100"

Citation

If you find this work useful please cite:

@InProceedings{Moran_2021_ICCV,
    author    = {Moran, Dror and Koslowsky, Hodaya and Kasten, Yoni and Maron, Haggai and Galun, Meirav and Basri, Ronen},
    title     = {Deep Permutation Equivariant Structure From Motion},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {5976-5986}
}

Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"

Topographic Variational Autoencoder Paper: https://arxiv.org/abs/2109.01394 Getting Started Install requirements with Anaconda: conda env create -f en

69 Dec 12, 2022

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

Neural Descriptor Fields (NDF) PyTorch implementation for training continuous 3D neural fields to represent dense correspondence across objects, and u

167 Jan 6, 2023

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

122 Dec 11, 2022

PyTorch implemention of ICCV'21 paper SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation

SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation This is the PyTorch implemention of ICCV'21 paper SGPA: Structure

24 Dec 5, 2022

Video Autoencoder: self-supervised disentanglement of 3D structure and motion

Comments

Segment error (core dumped)

Hello author. It's a good job! However, when I use your data for training, I always have an error "Segment error (core dumped)" after training for a period of time, and then the training ends. I have checked this error on the Internet. It may be due to memory out of bounds or pointer memory access error etc. I suspect that it is most likely that an error occurred when using cere for BA, but I am not very familiar with your code, and it will be difficult to locate the error for me. I'm wondering if you have encountered this problem. Do you have any good suggestions? Due to the computer configuration (The GPU is only a 3080ti and the GPU memory is 12G, and the number of CPU threads is 16), when I was training, I set the batch_size to 2, and then changed the number of threads during cere optimization to 16.

opened by CZ-CHO 12
about results

it's a nice job,but I found that the results were bad that the point cloud couldn't even produce a normal shape，Do I need to modify any super parameters。and，I did two experiments, using Locus point and not using Locus point，but it seems the same.thanks! ？

opened by ajunyo 5
Front-end / point track source

Really nice work. @drormoran, I was curious where the point tracks come from that you use in the project? (looks like the code loads them directly from disk)

opened by johnwlambert 1

An implementation for the ICCV 2021 paper Deep Permutation Equivariant Structure from Motion.

Related tags

Overview

Deep Permutation Equivariant Structure from Motion

Paper | Poster

Table of Contents

Setup

Folders

Conda envorinment

Data

How to use

Optimization

Learning

Citation

You might also like...

Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

PyTorch implemention of ICCV'21 paper SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation

Video Autoencoder: self-supervised disentanglement of 3D structure and motion

COLMAP - Structure-from-Motion and Multi-View Stereo

Making Structure-from-Motion (COLMAP) more robust to symmetries and duplicated structures

SatelliteSfM - A library for solving the satellite structure from motion problem

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Comments

Segment error (core dumped)

about results

Front-end / point track source

Owner

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

Implementation for paper "STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement" (ICCV 2021).

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

Deep Two-View Structure-from-Motion Revisited

PERIN is Permutation-Invariant Semantic Parser developed for MRP 2020

Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"

This repository contains the code for the paper "Hierarchical Motion Understanding via Motion Programs"

[ICCV'21] Official implementation for the paper Social NCE: Contrastive Learning of Socially-aware Motion Representations

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution