A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

Aatif Jiwani

Last update: Dec 8, 2022

Related tags

Deep Learning rgb-footprint-extract

Overview

A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

This repository is the official implementation of A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery by Aatif Jiwani, Shubhrakanti Ganguly, Chao Ding, Nan Zhou, and David Chan.

Requirements

To install GDAL/georaster, please follow this doc for instructions.
Install other dependencies from requirements.txt

pip install -r requirements.txt

Datasets

Downloading the Datasets

To download the AICrowd dataset, please go here. You will have to either create an account or sign in to access the training and validation set. Please store the training/validation set inside <root>/AICrowd/<train | val> for ease of conversion.
To download the Urban3D dataset, please run:

aws s3 cp --recursive s3://spacenet-dataset/Hosted-Datasets/Urban_3D_Challenge/01-Provisional_Train/ <root>/Urban3D/train
aws s3 cp --recursive s3://spacenet-dataset/Hosted-Datasets/Urban_3D_Challenge/02-Provisional_Test/ <root>/Urban3D/test

To download the SpaceNet Vegas dataset, please run:

aws s3 cp s3://spacenet-dataset/spacenet/SN2_buildings/tarballs/SN2_buildings_train_AOI_2_Vegas.tar.gz <root>/SpaceNet/Vegas/
aws s3 cp s3://spacenet-dataset/spacenet/SN2_buildings/tarballs/AOI_2_Vegas_Test_public.tar.gz <root>/SpaceNet/Vegas/

tar xvf <root>/SpaceNet/Vegas/SN2_buildings_train_AOI_2_Vegas.tar.gz
tar xvf <root>/SpaceNet/Vegas/AOI_2_Vegas_Test_public.tar.gz

Converting the Datasets

Please use our provided dataset converters to process the datasets. For all converters, please look at the individual files for an example of how to use them.

For AICrowd, use datasets/converters/cocoAnnotationToMask.py.
For Urban3D, use datasets/converters/urban3dDataConverter.py.
For SpaceNet, use datasets/converters/spaceNetDataConverter.py

Creating the Boundary Weight Maps

In order to train with the exponentially weighted boundary loss, you will need to create the weight maps as a pre-processing step. Please use datasets/converters/weighted_boundary_processor.py and follow the example usage. The inc parameter is specified for computational reasons. Please decrease this value if you notice very high memory usage.

Note: these maps are not required for evaluation / testing.

Training and Evaluation

To train / evaluate the DeepLabV3+ models described in the paper, please use train_deeplab.sh or test_deeplab.sh for your convenience. We employ the following primary command-line arguments:

Parameter	Default	Description (final argument)
--backbone	`resnet`	The DeeplabV3+ backbone (final method used `drn_c42`)
--out-stride	16	The backbone compression facter (8)
--dataset	`urban3d`	The dataset to train / evaluate on (other choices: `spaceNet`, `crowdAI`, `combined`)
--data-root	`/data/`	Please replace this with the root folder of the dataset samples
--workers	2	Number of workers for dataset retrieval
--loss-type	`ce_dice`	Type of objective function. Use `wce_dice` for exponentially weighted boundary loss
--fbeta	1	The beta value to use with the F-Beta Measure (0.5)
--dropout	`0.1 0.5`	Dropout values to use in the DeepLabV3+ (0.3 0.5)
--epochs	None	Number of epochs to train (60 for train, 1 for test)
--batch-size	None	Batch size (3/4)
--test-batch-size	None	Testing Batch Size (1/4)
--lr	`1e-4`	Learning Rate (`1e-3`)
--weight-decay	`5e-4`	L2 Regularization Constant (`1e-4`)
--gpu-ids	`0`	GPU Ids (Use `--no-cuda` for only CPU)
--checkname	None	Experiment name
--use-wandb	False	Track experiment using WandB
--resume	None	Experiment name to load weights from (i.e. `urban` for `weights/urban/checkpoint.pth.tar`)
--evalulate	False	Enable this flag for testing
--best-miou	False	Enable this flag to get best results when testing
--incl-bounds	False	Enable this flag when training with `wce_dice` as a loss

To train with the cross-task training strategy, you need to:

Train a model using --dataset=combined until the best loss has been achieved
Train a model using --resume=<checkname> on one of the three primary datasets until the best mIoU is achieved

Pre-Trained Weights

We provide pre-trained model weights in the weights/ directory. Please use Git LFS to download these weights. These weights correspond to our best model on all three datasets.

Results

Our final model is a DeepLavV3+ module with a Dilated ResNet C42 backbone trained using the F-Beta Measure + Exponentially Weighted Cross Entropy Loss (Beta = 0.5). We employ the cross-task training strategy only for Urban3D and SpaceNet.

Our model achieves the following:

Dataset	Avg. Precision	Avg. Recall	F1 Score	mIoU
Urban3D	83.8%	82.2%	82.4%	83.3%
SpaceNet	91.4%	91.8%	91.6%	90.2%
AICrowd	96.2%	96.3%	96.3%	95.4%

Acknowledgements

We would like to thank jfzhang95 for his DeepLabV3+ model and training template. You can access this repository here

Comments

inference on single image?

Hi,

Thanks for the great work! Would you please suggest how to run inference on a single image? I would like to test the model on my own urban satellite data. Thanks!

opened by ydzhang12345 7
Unable to Download Weights

Hello, I saw the research work and am looking forward to try it's results on my custom dataset but when I clone into github it's not allowing me to download the weights file. ERROR : This repository is over its data quota. Account responsible for LFS bandwidth should purchase more data packs to restore access. @aatifjiwani Please upload the weights file and share a link of drive or any other source
model weights

opened by Ankit-Vohra 4
pretrained model weights assume using GPU

FYI -- the pretrained model weights don't work if you aren't using a GPU as the model is only wrapped in DataParallel when you have a GPU here -- https://github.com/aatifjiwani/rgb-footprint-extract/blob/main/models/deeplab/evaluate.py#L58

opened by calebrob6 1
model_arg is not defined

The evaluate script does not run due to model_arg not being defined -- https://github.com/aatifjiwani/rgb-footprint-extract/blob/main/models/deeplab/evaluate.py#L59.

opened by calebrob6 1
testing model on a cpu-only machine

Hi :) i run the infer_deeplab.sh setting --no-cuda but i get this error:

RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') to map your storages to the CPU.

Can you help me? Thank you

I added map_location=torch.device('cpu') in torch.load in loader.py

opened by teresalisanti 0
URLError about class Tensor in evaluate file

Hi, I met a error called URLError when I use the class of evaluate file and the information shows below: File "inference.py", line 84, in main tester = Tester(args) File "/root/autodl-tmp/models/deeplab/evaluate.py", line 47, in init self.model = DeepLab( File "/root/autodl-tmp/models/deeplab/modeling/deeplab.py", line 22, in init self.backbone = build_backbone(backbone, output_stride, BatchNorm) File "/root/autodl-tmp/models/deeplab/modeling/backbone/init.py", line 11, in build_backbone return drn.drn_c_42(BatchNorm) File "/root/autodl-tmp/models/deeplab/modeling/backbone/drn.py", line 324, in drn_c_42 pretrained = model_zoo.load_url(model_urls['drn-c-42']) File "/root/miniconda3/lib/python3.8/site-packages/torch/hub.py", line 727, in load_state_dict_from_url download_url_to_file(url, cached_file, hash_prefix, progress=progress) File "/root/miniconda3/lib/python3.8/site-packages/torch/hub.py", line 593, in download_url_to_file u = urlopen(req) File "/root/miniconda3/lib/python3.8/urllib/request.py", line 222, in urlopen return opener.open(url, data, timeout) File "/root/miniconda3/lib/python3.8/urllib/request.py", line 525, in open response = self._open(req, data) File "/root/miniconda3/lib/python3.8/urllib/request.py", line 542, in _open result = self._call_chain(self.handle_open, protocol, protocol + File "/root/miniconda3/lib/python3.8/urllib/request.py", line 502, in _call_chain result = func(*args) File "/root/miniconda3/lib/python3.8/urllib/request.py", line 1383, in http_open return self.do_open(http.client.HTTPConnection, req) File "/root/miniconda3/lib/python3.8/urllib/request.py", line 1357, in do_open raise URLError(err) urllib.error.URLError: <urlopen error [Errno 113] No route to host> It turns out to be that the pretrain of DRN couldn't be download from http://dl.yf.io/drn/, and I can't open this url, do you have any solutions?

opened by walking-shadow 0
I want to ask a question about test,please~

About AICrowd dataset,train and val datasets have their corresponding annotations ,however,test have not.

After you train a model , what you use to test?

opened by zhenghan408 0
Best model weights

Could you release the weights for the best model, i.e. the one described as, "Our final model is a DeepLavV3+ module with a Dilated ResNet C42 backbone trained using the F-Beta Measure + Exponentially Weighted Cross Entropy Loss (Beta = 0.5). We employ the cross-task training strategy only for Urban3D and SpaceNet"?

opened by calebrob6 2

Owner

Aatif Jiwani

Hey! I am Aatif Jiwani, and I am currently a Machine Learning Engineer at C3.ai. Previously, I studied EECS at UC Berkeley and did research at BAIR and LBNL.

GitHub https://arxiv.org/abs/2104.01263

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery by Ailong Ma, Junjue Wang*, Yanfei Zhon

43 Jan 5, 2023

Code and model benchmarks for "SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology"

NeurIPS 2020 SEVIR Code for paper: SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology Requirement

USAF - MIT Artificial Intelligence Accelerator

46 Dec 15, 2022

Train a deep learning net with OpenStreetMap features and satellite imagery.

DeepOSM Classify roads and features in satellite imagery, by training neural networks with OpenStreetMap (OSM) data. DeepOSM can: Download a chunk of

1.3k Nov 24, 2022

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

2 Jan 5, 2022

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

InfoPro-Pytorch The Information Propagation algorithm for training deep networks with local supervision. (ICLR 2021) Revisiting Locally Supervised Lea

78 Dec 27, 2022

DSAC* for Visual Camera Re-Localization (RGB or RGB-D)

DSAC* for Visual Camera Re-Localization (RGB or RGB-D) Introduction Installation Data Structure Supported Datasets 7Scenes 12Scenes Cambridge Landmark

143 Dec 22, 2022

CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images

Code and result about CCAFNet(IEEE TMM) 'CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images' IEE

14 Dec 29, 2021

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Shape-aware Convolutional Layer (ShapeConv) PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentatio

82 Dec 29, 2022

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Panoptic NeRF Project Page | Paper | Dataset Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation Xiao Fu*, Shangzhan zhang*,

111 Dec 16, 2022

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation Xiao Fu1* Shangzhan Zhang1* Tianrun Chen1 Yichong Lu1 Lanyun Zhu2 Xi

37 May 17, 2022

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

ActNN : Activation Compressed Training This is the official project repository for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Comp

178 Jan 5, 2023

Calculates carbon footprint based on fuel mix and discharge profile at the utility selected. Can create graphs and tabular output for fuel mix based on input file of series of power drawn over a period of time.

carbon-footprint-calculator Conda distribution ~/anaconda3/bin/conda install anaconda-client conda-build ~/anaconda3/bin/conda config --set anaconda_u

Seattle university Renewable energy research

7 Sep 26, 2022

A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

Related tags

Overview

A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

Requirements

Datasets

Downloading the Datasets

Converting the Datasets

Creating the Boundary Weight Maps

Training and Evaluation

Pre-Trained Weights

Results

Acknowledgements

Comments

Owner

Aatif Jiwani

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

Code and model benchmarks for "SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology"

Train a deep learning net with OpenStreetMap features and satellite imagery.

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

DSAC* for Visual Camera Re-Localization (RGB or RGB-D)

CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Calculates carbon footprint based on fuel mix and discharge profile at the utility selected. Can create graphs and tabular output for fuel mix based on input file of series of power drawn over a period of time.

Attention for PyTorch with Linear Memory Footprint

PyTorchMemTracer - Depict GPU memory footprint during DNN training of PyTorch

Recall Loss for Semantic Segmentation (This repo implements the paper: Recall Loss for Semantic Segmentation)

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

Experiments on Flood Segmentation on Sentinel-1 SAR Imagery with Cyclical Pseudo Labeling and Noisy Student Training

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

Urban mobility simulations with Python3, RLlib (Deep Reinforcement Learning) and Mesa (Agent-based modeling)