Canonical Appearance Transformations

STARS Laboratory

Last update: Dec 24, 2022

Related tags

Overview

CAT-Net: Learning Canonical Appearance Transformations

Code to accompany our paper "How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change".

Dependencies

numpy
matpotlib
pytorch + torchvision (1.2)
Pillow
progress (for progress bars in train/val/test loops)
tensorboard + tensorboardX (for visualization)
pyslam + liegroups (optional, for running odometry/localization experiments)
OpenCV (optional, for running odometry/localization experiments)

Training the CAT

Download the ETHL dataset from here or the Virtual KITTI dataset from here
1. ETHL only: rename ethl1/2 to ethl1/2_static.
2. ETHL only: Update the local paths in tools/make_ethl_real_sync.py and run python3 tools/make_ethl_real_sync.py to generate a synchronized copy of the real sequences.
Update the local paths in run_cat_ethl/vkitti.py and run python3 run_cat_ethl/vkitti.py to start training.
In another terminal run tensorboard --port [port] --logdir [path] to start the visualization server, where [port] should be replaced by a numeric value (e.g., 60006) and [path] should be replaced by your local results directory.
Tune in to localhost:[port] and watch the action.

Running the localization experiments

Ensure the pyslam and liegroups packages are installed.
Update the local paths in make_localization_data.py and run python3 make_localization_data.py [dataset] to compile the model outputs into a localization_data directory.
Update the local paths in run_localization_[dataset].py and run python3 run_localization_[dataset].py [rgb,cat] to compute VO and localization results using either the original RGB or CAT-transformed images.
You can compute localization errors against ground truth using the compute_localization_errors.py script, which generates CSV files and several plots. Update the local paths and run python3 compute_localization_errors.py [dataset].

Citation

If you use this code in your research, please cite:

@article{2018_Clement_Learning,
  author = {Lee Clement and Jonathan Kelly},
  journal = {{IEEE} Robotics and Automation Letters},
  link = {https://arxiv.org/abs/1709.03009},
  title = {How to Train a {CAT}: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change},
  year = {2018}
}

Comments

Would you share the Pre-trained models and the test script?

Hi, thanks for share and your fantastic work! I'm very interested in your project and would you share the Pre-trained models and the test script? Thanks!

opened by HyuanTan 2

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

11.4k Feb 13, 2021

Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)

Learning Structural Edits via Incremental Tree Transformations Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21) 1.

40 Dec 23, 2022

This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations at CVPR'21. According to some product reasons, we are not planning to release the training/testing codes and models. However, we will release the dataset and the scripts to prepare the dataset.

TransFill-Reference-Inpainting This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transf

80 Dec 8, 2022

Callable PyTrees and filtered JIT/grad transformations = neural networks in JAX.

Equinox Callable PyTrees and filtered JIT/grad transformations = neural networks in JAX Equinox brings more power to your model building in JAX. Repr

909 Dec 30, 2022

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Multi-Modal Self-Supervision using GDT and StiCa This is an official pytorch implementation of papers: Multi-modal Self-Supervision from Generalized D

42 Dec 9, 2022

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Data Augmentation for Scene Text Recognition (ICCV 2021 Workshop) (Pronounced as "strog") Paper Arxiv Why it matters? Scene Text Recognition (STR) req

152 Dec 28, 2022

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

22 Jul 7, 2022

Using some basic methods to show linkages and transformations of robotic arms

roboticArmVisualizer Python GUI application to create custom linkages and adjust joint angles. In the future, I plan to add 2d inverse kinematics solv

1 Nov 19, 2021

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Self-Supervised Graph Representation Learning via Topology Transformations This repository is the official PyTorch implementation of the following pap

2 Oct 31, 2022

Canonical Appearance Transformations

Related tags

Overview

CAT-Net: Learning Canonical Appearance Transformations

Dependencies

Training the CAT

Running the localization experiments

Citation

You might also like...

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)

Callable PyTrees and filtered JIT/grad transformations = neural networks in JAX.

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Using some basic methods to show linkages and transformations of robotic arms

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Comments

Would you share the Pre-trained models and the test script?

Owner

STARS Laboratory

Code for "Learning Canonical Representations for Scene Graph to Image Generation", Herzig & Bar et al., ECCV2020

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

DeepFaceEditing: Deep Face Generation and Editing with Disentangled Geometry and Appearance Control

[CVPR'21] DeepSurfels: Learning Online Appearance Fusion

Unified tracking framework with a single appearance model

Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance

A customisable game where you have to quickly click on black tiles in order of appearance while avoiding clicking on white squares.

SLAMP: Stochastic Latent Appearance and Motion Prediction

Pytorch implementation for A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more