Specificity-preserving RGB-D Saliency Detection

Tao Zhou

Last update: Jan 8, 2023

Related tags

Deep Learning SPNet

Overview

Specificity-preserving RGB-D Saliency Detection

Authors: Tao Zhou, Huazhu Fu, Geng Chen, Yi Zhou, Deng-Ping Fan, and Ling Shao.

1. Preface

This repository provides code for "Specificity-preserving RGB-D Saliency Detection" ICCV-2021.

2. Overview

2.1. Introduction

RGB-D saliency detection has attracted increasing attention, due to its effectiveness and the fact that depth cues can now be conveniently captured. Existing works often focus on learning a shared representation through various fusion strategies, with few methods explicitly considering how to preserve modality-specific characteristics. In this paper, taking a new perspective, we propose a specificitypreserving network (SP-Net) for RGB-D saliency detection, which benefits saliency detection performance by exploring both the shared information and modality-specific properties (e.g., specificity). Specifically, two modality-specific networks and a shared learning network are adopted to generate individual and shared saliency maps. A crossenhanced integration module (CIM) is proposed to fuse cross-modal features in the shared learning network, which are then propagated to the next layer for integrating cross-level information. Besides, we propose a multi-modal feature aggregation (MFA) module to integrate the modality-specific features from each individual decoder into the shared decoder, which can provide rich complementary multi-modal information to boost the saliency detection performance. Further, a skip connection is used to combine hierarchical features between the encoder and decoder layers. Experiments on six benchmark datasets demonstrate that our SP-Net outperforms other state-of-the-art methods.

2.2. Framework Overview

Figure 1: The overall architecture of the proposed SP-Net.

2.3. Quantitative Results

2.4. Qualitative Results

Figure 2: Visual comparisons of our method and eight state-of-the-art methods.

3. Proposed Baseline

3.1. Training/Testing

The training and testing experiments are conducted using PyTorch with one NVIDIA Tesla V100 GPU with 32 GB memor.

Configuring your environment (Prerequisites):
- Installing necessary packages: pip install -r requirements.txt.
Downloading necessary data:
- Downloading training dataset (download link (Google Drive)) and move it into ./Data/.
- Downloading testing dataset (download link (Google Drive)) and move it into ./Data/.
- Downloading pretrained weights (download link (Google Drive)) and move it into ./Checkpoint/SPNet/.
Train Configuration:
- After you download training dataset, just run train.py to train our model.
Test Configuration:
- After you download all the pre-trained model and testing dataset, just run test_produce_maps.py to generate the final prediction map, then run test_evaluation_maps.py to obtain the final quantitative results.
- You can also download predicted saliency maps (download link (Google Drive)) and move it into ./Predict_maps/, then then run test_evaluation_maps.py.

3.2 Evaluating your trained model:

Our evaluation is implemented by python, please refer to test_evaluation_maps.py

4. Citation

Please cite our paper if you find the work useful, thanks!

@inproceedings{zhouiccv2021,
	title={Specificity-preserving RGB-D Saliency Detection},
	author={Zhou, Tao and Fu, Huazhu and Chen, Geng and Zhou, Yi and Fan, Deng-Ping and Shao, Ling},
	booktitle={International Conference on Computer Vision (ICCV)},
	year={2021},
}

@inproceedings{zhoucvmj2022,
	title={Specificity-preserving RGB-D Saliency Detection},
	author={Zhou, Tao and Fan, Deng-Ping and Chen, Geng and Zhou, Yi and Fu, Huazhu},
	booktitle={Computational Visual Media},
	year={2022},
}

⬆ back to top

You might also like...

A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science

PyGrid is a peer-to-peer network of data owners and data scientists who can collectively train AI models using PySyft. PyGrid is also the central serv

615 Jan 3, 2023

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

federated is the source code for the Bachelor's Thesis Privacy-Preserving Federated Learning Applied to Decentralized Data (Spring 2021, NTNU) Federat

25 Nov 30, 2022

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition This is the research repository for Vid2

26 Dec 24, 2022

Tensorflow implementation of the paper "HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences", CVPR 2021.

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences Tensorflow implementation of the paper "HumanGPS: Geodesic PreServing Feature fo

50 Dec 21, 2022

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Official implementation of the paper "Lightweight Deep CNN for Natural Image Matting via Similarity Preserving Knowledge Distillation"

Lightweight-Deep-CNN-for-Natural-Image-Matting-via-Similarity-Preserving-Knowledge-Distillation Introduction Accepted at IEEE Signal Processing Letter

19 Jun 7, 2022

Comments

Performance on DUT

Have you evaluted the model on DUT-RGBD? I have used the code test_performance_maps.py and the model 'SPNet_best.pth' you provided to predicet the salient maps. However, the results is different from you gived in the paper, including the NJUD. Is there anything wrong? I hope for your reply! Thanks!

opened by wkkppp 9
Performance on LFSD

@taozh2017 Thanks for your code. I find the performance of SPNet is relatively poor when evaluated on LFSD. i am not sure whether the evalutaion results is correct or not. Have you evaluted the model on LFSD?

opened by clelouch 9

Specificity-preserving RGB-D Saliency Detection

Related tags

Overview

Specificity-preserving RGB-D Saliency Detection

1. Preface

2. Overview

2.1. Introduction

2.2. Framework Overview

2.3. Quantitative Results

2.4. Qualitative Results

3. Proposed Baseline

3.1. Training/Testing

3.2 Evaluating your trained model:

4. Citation

You might also like...

A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

Tensorflow implementation of the paper "HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences", CVPR 2021.

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

Fuzzing JavaScript Engines with Aspect-preserving Mutation

Boundary-preserving Mask R-CNN (ECCV 2020)

Official implementation of the paper "Lightweight Deep CNN for Natural Image Matting via Similarity Preserving Knowledge Distillation"

Comments

Performance on DUT

Performance on LFSD

Owner

Tao Zhou

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

[ECCV 2020] Gradient-Induced Co-Saliency Detection

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

source code of “Visual Saliency Transformer” (ICCV2021)

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

Task-related Saliency Network For Few-shot learning

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities