CTRL-C: Camera calibration TRansformer with Line-Classification

Last update: Nov 14, 2022

Related tags

Deep Learning CTRL-C

Overview

CTRL-C: Camera calibration TRansformer with Line-Classification

This repository contains the official code and pretrained models for CTRL-C (Camera calibration TRansformer with Line-Classification). Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung and Junho Kim. ICCV 2021.

Single image camera calibration is the task of estimating the camera parameters from a single input image, such as the vanishing points, focal length, and horizon line. In this work, we propose Camera calibration TRansformer with Line-Classification (CTRL-C), an end-to-end neural network-based approach to single image camera calibration, which directly estimates the camera parameters from an image and a set of line segments. Our network adopts the transformer architecture to capture the global structure of an image with multi-modal inputs in an end-to-end manner. We also propose an auxiliary task of line classification to train the network to extract the global geometric information from lines effectively. Our experiments demonstrate that CTRL-C outperforms the previous state-of-the-art methods on the Google Street View and SUN360 benchmark datasets.

Results & Checkpoints

Dataset	Up Dir (◦)	Pitch (◦)	Roll (◦)	FoV (◦)	AUC (%)	URL
Google Street View	1.80	1.58	0.66	3.59	87.29	gdrive
SUN360	1.91	1.50	0.96	3.80	85.45	gdrive

Preparation

Clone this repository

Setup environments

conda create -n ctrlc python
conda activate ctrlc
conda install -c pytorch torchvision

pip install -r requrements.txt

Training Datasets

Google Street View dataset
SUN360 dataset
- You need to preprocess the dataset

Training

Single GPU

python main.py --config-file 'config-files/ctrl-c.yaml' --opts OUTPUT_DIR 'logs'

Multi GPU

python -m torch.distributed.launch --nproc_per_node=4 --use_env main.py --config-file 'config-files/ctrl-c.yaml' --opts OUTPUT_DIR 'logs'

Evaluation

python test.py --dataset 'GoogleStreetView' --opts OUTPUT_DIR 'outputs'

Citation

If you use this code for your research, please cite our paper:

@InProceedings{Lee:2021:ICCV,
    Title     = {{CTRL-C: Camera calibration TRansformer with Line-Classification}},
    Author    = {Jinwoo Lee and Hyunsung Go and Hyunjoon Lee and Sunghyun Cho and Minhyuk Sung and Junho Kim},    
    Booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    Year      = {2021},
}

License

CTRL-C is released under the Apache 2.0 license. Please see the LICENSE file for more information.

Acknowledgments

This code is based on the implementations of DETR: End-to-End Object Detection with Transformers.

You might also like...

Improving Calibration for Long-Tailed Recognition (CVPR2021)

MiSLAS Improving Calibration for Long-Tailed Recognition Authors: Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia [arXiv] [slide] [BibTeX] Introductio

116 Dec 20, 2022

Improving Calibration for Long-Tailed Recognition (CVPR2021)

19 Apr 28, 2021

Pytorch Implementation of Spiking Neural Networks Calibration, ICML 2021

SNN_Calibration Pytorch Implementation of Spiking Neural Networks Calibration, ICML 2021 Feature Comparison of SNN calibration: Features SNN Direct Tr

60 Dec 27, 2022

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

S2-BNN (Self-supervised Binary Neural Networks Using Distillation Loss) This is the official pytorch implementation of our paper: "S2-BNN: Bridging th

52 Dec 24, 2022

Comments

focal scale
Dear author:

I have a question about the focal result. I see you resize the input image by:

image = cv2.resize(image, dsize=(self.input_width, self.input_height))

Then the resized image will be inputted into the model. Following, the focal result will be obtained by fov and original size.

Therefore, do you mean the focal length has no relationship with the change of the size of the image? Becasue the input image size of the model is 512x512, but the original image size is 640x640.

Thank you! Best regards!
opened by songxujay 7

Dataset zip file is broken

Hi, thank you for the nice work!

The GSV zip file from the gdrive link seems to be broken. Could you help fix that? Thank you!

unzip google_street_view_191210.zip

Archive:  google_street_view_191210.zip
warning [google_street_view_191210.zip]:  611663008 extra bytes at beginning or within zipfile
  (attempting to process anyway)
error [google_street_view_191210.zip]:  start of central directory not found;
  zipfile corrupt.
  (please check that you have transferred or created the zipfile in the
  appropriate BINARY mode and that you have compiled UnZip properly)

opened by jinlinyi 0

CTRL-C: Camera calibration TRansformer with Line-Classification

Related tags

Overview

CTRL-C: Camera calibration TRansformer with Line-Classification

Results & Checkpoints

Preparation

Training Datasets

Training

Evaluation

Citation

License

Acknowledgments

You might also like...

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Pytorch Implementation of Spiking Neural Networks Calibration, ICML 2021

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

the code for paper "Energy-Based Open-World Uncertainty Modeling for Confidence Calibration"

A python toolbox for predictive uncertainty quantification, calibration, metrics, and visualization

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Source code of NeurIPS 2021 Paper ''Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration''

Comments

focal scale

Dataset zip file is broken

Owner

Blender add-on: Add to Cameras menu: View → Camera, View → Add Camera, Camera → View, Previous Camera, Next Camera

Camera calibration & 3D pose estimation tools for AcinoSet

Automatic Calibration for Non-repetitive Scanning Solid-State LiDAR and Camera Systems

Omnidirectional camera calibration in python

Camera-caps - Examine the camera capabilities for V4l2 cameras

Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

The comma.ai Calibration Challenge!

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration