KinectFusion implemented in Python with PyTorch

Jingwen Wang

Last update: Jan 3, 2023

Related tags

Deep Learning python slam 3d-reconstruction rgbd-slam kinect-fusion iterative-closest-point pytorch-implementation dense-slam

Overview

KinectFusion implemented in Python with PyTorch

This is a lightweight Python implementation of KinectFusion. All the core functions (TSDF volume, frame-to-model tracking, point-to-plane ICP, raycasting, TSDF fusion, etc.) are implemented using pure PyTorch, i.e. no custom CUDA kernels.

Although without any custom CUDA functions, the system could still run at a fairly fast speed: The demo reconstructs the TUM fr1_desk sequence into a 225 x 171 x 111 TSDF volume with 2cm resolution at round 17 FPS with a single RTX-2080 GPU (~1.5 FPS in CPU mode)

Note that this project is mainly for study purpose, and is not fully optimized for accurate camera tracking.

Requirements

The core functionalities were implemented in PyTorch (1.10). Open3D (0.14.0) is used for visualisation. Other important dependancies include:

numpy==1.21.2
opencv-python==4.5.5
imageio==2.14.1
scikit-image==0.19.1
trimesh==3.9.43

You can create an anaconda environment called kinfu with the required dependencies by running:

conda env create -f environment.yml
conda activate kinfu

Data Preparation

The code was tested on TUM dataset. After downloading the raw sequences, you will need to run the pre-processing script under dataset/. For example:

python dataset/preprocess.py --config configs/fr1_desk.yaml

There are some example config files under configs/ which correspond to different sequences. You need to replace data_root to your own sequence directory before running the script. After running the script a new directory processed/ will appear under your sequence directory.

Run

After obtaining the processed sequence, you can simply run kinfu.py. For example:

python kinfu.py --config configs/fr1_desk.yaml --save_dir reconstruct/fr1_desk

which will perform the tracking and mapping headlessly and save the results. Or you could run:

python kinfu_gui.py --config configs/fr1_desk.yaml

If you want to visualize the tracking and reconstruction process on-the-fly.

Acknowledgement

Part of the tracking code was borrowed and modified from DeepIC. Also thank Binbin Xu for implementing part of the TSDF volume code which is inspired by Andy Zeng's tsdf-fusion-python.

Comments

Code run fine, but need clarification regarding pose matrix
Can we assume that this algorithm only needs an initial pose matrix, or does it need poses for each step in the entire camera trajectory?

If so, how can we specify initial pose for a custom dataset?

I checked the world_mats variable in tum_rgbd.py and I tweeked it to use only store the first pose value in a repeated fashion (eg below)

pose1 = np.array([[ 3.18131473e+02, 4.60102595e+02, -2.37143996e+02, 6.60288595e+00], [ 1.32248250e+02, -1.87081717e+02, -5.28617550e+02, 5.54961371e+02], [-3.51650973e-01, 5.42460640e-01, -7.62940395e-01, 1.31740951e+00]])

pose2 = np.array([[ 3.44746770e+02, 4.56713777e+02, -2.04209444e+02, -5.30226155e+02], [ 9.00883139e+01, -1.36841787e+02, -5.52344190e+02, 8.36896216e+02], [-3.39983783e-01, 6.50759651e-01, -6.78913032e-01, 9.25806734e-01]])

ini_pose = pose2 tentative_poses = [ini_pose for _ in range(572)] world_mats = np.stack(tentative_poses, axis=0)

I see that if I set ini_pose=pose1 (a random value), reconstruction messes up, but if I set=pose2 (cameras.npz, first pose matrix) then the code works as usual, so is there something special about the initial pose ?
opened by Homagn 1

Reinforcement learning framework and algorithms implemented in PyTorch.

2.1k Jan 4, 2023

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

SLATE This is the official source code for SLATE. We provide the code for the model, the training code and a dataset loader for the 3D Shapes dataset.

66 Dec 26, 2022

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

893 Dec 28, 2022

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

AutoML for Image Semantic Segmentation Currently this repo contains the only working open-source implementation of Auto-Deeplab which, by the way out-

299 Dec 17, 2022

This is a file about Unet implemented in Pytorch

Unet this is an implemetion of Unet in Pytorch and it's architecture is as follows which is the same with paper of Unet component of Unet Convolution

1 Dec 3, 2021

Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! Very tiny! Stock Market Financial Technical Analysis Python library . Quant Trading automation or cryptocoin exchange

MyTT Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! to Stock Market Financial Technical Analysis Python

34 Dec 27, 2022

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

Note: the current releases of this toolbox are a beta release, to test working with Haskell's, Python's, and R's code repositories. Metrics provides i

1.6k Dec 26, 2022

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

Implemented fully documented Particle Swarm Optimization (PSO) algorithm in Python which includes a basic model along with few advanced features such as updating inertia weight, cognitive, social learning coefficients and maximum velocity of the particle.

9 Nov 29, 2022

DIT is a DTLS MitM proxy implemented in Python 3. It can intercept, manipulate and suppress datagrams between two DTLS endpoints and supports psk-based and certificate-based authentication schemes (RSA + ECC).

DIT - DTLS Interception Tool DIT is a MitM proxy tool to intercept DTLS traffic. It can intercept, manipulate and/or suppress DTLS datagrams between t

52 Nov 30, 2022

KinectFusion implemented in Python with PyTorch

Related tags

Overview

KinectFusion implemented in Python with PyTorch

Requirements

Data Preparation

Run

Acknowledgement

You might also like...

Reinforcement learning framework and algorithms implemented in PyTorch.

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

This is a file about Unet implemented in Pytorch

Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! Very tiny! Stock Market Financial Technical Analysis Python library . Quant Trading automation or cryptocoin exchange

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

DIT is a DTLS MitM proxy implemented in Python 3. It can intercept, manipulate and suppress datagrams between two DTLS endpoints and supports psk-based and certificate-based authentication schemes (RSA + ECC).

Comments

Code run fine, but need clarification regarding pose matrix

Owner

Jingwen Wang

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

SimDeblur is a simple framework for image and video deblurring, implemented by PyTorch

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Transformer model implemented with Pytorch

Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM

NEG loss implemented in pytorch

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

Time Delayed NN implemented in pytorch

Highway networks implemented in PyTorch.

KinectFusion implemented in Python with PyTorch

Related tags

Overview

KinectFusion implemented in Python with PyTorch

Requirements

Data Preparation

Run

Acknowledgement

You might also like...

Reinforcement learning framework and algorithms implemented in PyTorch.

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

This is a file about Unet implemented in Pytorch

Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! Very tiny! Stock Market Financial Technical Analysis Python library . Quant Trading automation or cryptocoin exchange

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

DIT is a DTLS MitM proxy implemented in Python 3. It can intercept, manipulate and suppress datagrams between two DTLS endpoints and supports psk-based and certificate-based authentication schemes (RSA + ECC).

Comments

Code run fine, but need clarification regarding pose matrix

Owner

Jingwen Wang

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

SimDeblur is a simple framework for image and video deblurring, implemented by PyTorch

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Transformer model implemented with Pytorch

Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM

NEG loss implemented in pytorch

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

Time Delayed NN implemented in pytorch

Highway networks implemented in PyTorch.

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,