DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Kuang-Jui Hsu

Last update: Dec 22, 2022

Related tags

Deep Learning deep-learning matlab convolutional-neural-networks unsupervised-learning matconvnet weakly-supervised-learning saliency instance-segmentation co-saliency co-segmentation cvpr-2019 cvpr2019 cvpr19

Overview

[CVPR19] DeepCO³: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Authors: Kuang-Jui Hsu, Yen-Yu Lin, Yung-Yu Chuang

PDF: High-Resolution, Low-Resolution
Supplementary material: High-Resolution, Low-Resolution

Abstract

In this paper, we address a new task called instance cosegmentation. Given a set of images jointly covering object instances of a specific category, instance co-segmentation aims to identify all of these instances and segment each of them, i.e. generating one mask for each instance. This task is important since instance-level segmentation is preferable for humans and many vision applications. It is also challenging because no pixel-wise annotated training data are available and the number of instances in each image is unknown. We solve this task by dividing it into two sub-tasks, co-peak search and instance mask segmentation. In the former sub-task, we develop a CNN-based network to detect the co-peaks as well as co-saliency maps for a pair of images. A co-peak has two endpoints, one in each image, that are local maxima in the response maps and similar to each other. Thereby, the two endpoints are potentially covered by a pair of instances of the same category. In the latter subtask, we design a ranking function that takes the detected co-peaks and co-saliency maps as inputs and can select the object proposals to produce the final results. Our method for instance co-segmentation and its variant for object colocalization are evaluated on four datasets, and achieve favorable performance against the state-of-the-art methods.

Examples

Two examples of instance co-segmentation on categories bird and sheep, respectively. An instance here refers to an object appearing in an image. In each example, the top row gives the input images while the bottom row shows the instances segmented by our method. The instance-specific coloring indicates that our method produces a segmentation mask for each instance.

Overview of our method

The proposed method contains two stages, co-peak search within the blue-shaded background and instance mask segmentation within the red-shaded background. For searching co-peaks in a pair of images, our model extracts image features, estimates their co-saliency maps, and performs feature correlation for co-peak localization. The model is optimized by three losses, including the co-peak loss, the affinity loss, and the saliency loss. For instance mask segmentation, we design a ranking function taking the detected co-peaks, the co-saliency maps, and the object proposals as inputs, and select the top-ranked proposal for each detected instance.

Results

Instance co-segmentation

The performance of instance co-segmentation on the four collected datasets is shown. The numbers in red and green show the best and the second best results, respectively. The column “trained” indicates whether additional training data are used.

Object co-localization

The performance of object co-localization on the four datasets is shown. The numbers in red and green indicate the best and the second best results, respectively. The column “trained” indicates whether additional training data are used.

Please cite our paper if this code is useful for your research.


@inproceedings{HsuCVPR19,
  author = {Kuang-Jui Hsu and Yen-Yu Lin and Yung-Yu Chuang},
  booktitle = {IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)},
  title = {DeepCO$^3$: Deep Instance Co-segmentation by Co-peak Search and Co-saliency Detection},
  year = {2019}
}

Codes for DeepCO³

Contact: Kuang-Jui Hsu
Last update: 2019/04/09
Platform: Ubuntu 14.04, MatConvnet 1.0-beta24 (Don't support any installation problem of MatConvnet.)

Demo for all stages: "RunDeepInstCoseg.m"

Including all files in "Lib" (Downloading MatConvnet is not necessary)
May be slightly different from the ones in paper because of the randdom seeds

Datasets (about 34 GB):

Including four collected datasets
Containing the images, ground-truth masks, salinecy maps and object proposals
GoogleDrive

Results reported in the papers (about 4 GB):

Only including the results of DeepCO³
GoogleDrive

Download Codes from GoogleDrive :

GoogleDrive

Errata:

Thank Howard Yu-Chun Lo for pointing the typo in Eq. (4). The corrected one is listed in the following:

Comments

Experiment Problem

I've run the demo for 22 categories. However, the result seems that there is a difference compared to your result. And I don't know my experiment's problem.

Thanks a lot!

opened by huang1997214 7
about mcg

Hi, @KuangJuiHsu ,

You compare your method with C2SNet, a saliency detection method, I, therefore, tried to reimplement the instance segmentation result based on the contour map(from C2SNet) and the MCG method. However, I met the problem that the contour map generated by C2SNet is different from the contour map of UCMs, which finally leads the MCG cannot generate segment proposals.

Could you give me any suggestions about your implementation detail?

Thanks in advance. Xin

opened by GrassBro 5
Number of images required to obtain results?
Hi,

Great work and very impressive results. I have three problems related to the paper and hope you could help me solve this:

Based on your experience, how many images of a certain class are required to train DeepCO3 to converge well? In your provided dataset, the number of images of cows, sheep and horses are more than that of trains, air-plane, bus. Correspondingly, results on categories with more images seem better than others. (I suppose we train different DeepCO3 for different classes instead of mixing them.)

If i understand correctly, the whole pipeline of DeepCO3 is fully un-supervised, isn't it?

You mentioned in Sec. 3.3 you used MCG for object proposals which is unsupervised. However, MCG seems to use BSD dataset to get a good setup for parameters. Do you think whether this minimum amount of supervised learning affects the definition of "unsupervised" for MCG, or DeepCO3?

Thanks.
opened by Harry-Zhi 4
About the co-peak size

Hi,

That's a very interesting work and I have a question about this paper and hope you can help me.

You resize the images to 448x448 and stimulate the co-peak with fixed 3x3x3x3 size. However the object sizes are various in different images, how does the co-peak loss to handle this varience?

Thanks.

opened by Spark001 3
Training procedures and datasets

Hi, Great work! It is very promising.

I have some problems about the training procedures and datasets: (1) The training of co-peak module, and the testing of final instance co-segmentation share the same datasets, i.e. COCO-VOC, COCO-NONVOC, VOC12, and SOC? I am confused about this. (2) The training of co-peak module is unsupervised or weakly supervised (images sharing the semantically related objects), thus the validation is conducted upon the three losses, is that right? Because you indicate that the optimization procedure stops after 40 epochs. (3) The training and validation data of co-peak module are the same, but shuffling in each batch?

Thanks.

opened by JingweiQu 2
Results of other methods!

Hi, could you share the results of all other instance-aware results compared in Table2 of your paper, to promote the rapid development of instance co-segmentation, thanks a lot!

opened by ltp1995 1

Owner

Kuang-Jui Hsu

GitHub

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion (CVPR'2021, Oral)

DSA^2 F: Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion (CVPR'2021, Oral) This repo is the official imp

46 Dec 21, 2022

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

This project is a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

147 Dec 3, 2022

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

CSF Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion Tips: For testing: CUDA_VISIBLE_DEVICES=0 python main.py For trai

14 Oct 31, 2022

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

DHF1K =========================================================================== Wenguan Wang, J. Shen, M.-M Cheng and A. Borji, Revisiting Video Sal

126 Dec 3, 2022

[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

InsGen - Data-Efficient Instance Generation from Instance Discrimination Data-Efficient Instance Generation from Instance Discrimination Ceyuan Yang,

GenForce: May Generative Force Be with You

93 Dec 25, 2022

[ECCV 2020] Gradient-Induced Co-Saliency Detection

Gradient-Induced Co-Saliency Detection Zhao Zhang*, Wenda Jin*, Jun Xu, Ming-Ming Cheng ⭐ Project Home » The official repo of the ECCV 2020 paper Grad

35 Nov 25, 2022

source code of “Visual Saliency Transformer” (ICCV2021)

Visual Saliency Transformer (VST) source code for our ICCV 2021 paper “Visual Saliency Transformer” by Nian Liu, Ni Zhang, Kaiyuan Wan, Junwei Han, an

89 Dec 21, 2022

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

49 Nov 5, 2022

Task-related Saliency Network For Few-shot learning

Task-related Saliency Network For Few-shot learning This is an official implementation in Tensorflow of TRSN. Abstract An essential cue of human wisdo

1 Nov 18, 2021

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

Saiency Map-aided GAN for RAW2RGB Mapping The PyTorch implementations and guideline for Saiency Map-aided GAN for RAW2RGB Mapping. 1 Implementations B

20 Oct 24, 2022

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

PGNet Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022, CVPR 2022 (arXiv 2204.05041) Abstract Recent salient objec

109 Dec 5, 2022

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays In this repo, you will find the instructions on how to requ

4 Jul 21, 2022

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [BCNet, CVPR 2021] This is the official pytorch implementation of BCNet built on

434 Dec 1, 2022

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

Deep Text Search - AI Based Text Search & Recommendation System Deep Text Search is an AI-powered multilingual text search and recommendation engine w

19 Sep 29, 2022

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

Instance-Aware Latent-Space Search This is a PyTorch implementation of the following paper: Disentangled Face Attribute Editing via Instance-Aware Lat

67 Dec 21, 2022

Complete-IoU (CIoU) Loss and Cluster-NMS for Object Detection and Instance Segmentation (YOLACT)

Complete-IoU Loss and Cluster-NMS for Improving Object Detection and Instance Segmentation. Our paper is accepted by IEEE Transactions on Cybernetics

290 Dec 25, 2022

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Mask R-CNN for Object Detection and Segmentation This is an implementation of Mask R-CNN on Python 3, Keras, and TensorFlow. The model generates bound

22.5k Jan 4, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Swin Transformer for Object Detection This repo contains the supported code and configuration files to reproduce object detection results of Swin Tran

1.4k Dec 30, 2022

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Faster R-CNN and Mask R-CNN in PyTorch 1.0 maskrcnn-benchmark has been deprecated. Please see detectron2, which includes implementations for all model

9k Jan 4, 2023

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Related tags

Overview

[CVPR19] DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Authors: Kuang-Jui Hsu, Yen-Yu Lin, Yung-Yu Chuang

Abstract

Examples

Overview of our method

Results

Codes for DeepCO3

Demo for all stages: "RunDeepInstCoseg.m"

Datasets (about 34 GB):

Results reported in the papers (about 4 GB):

Download Codes from GoogleDrive :

Errata:

Comments

Experiment Problem

about mcg

Number of images required to obtain results?

About the co-peak size

Training procedures and datasets

Results of other methods!

Owner

Kuang-Jui Hsu

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion (CVPR'2021, Oral)

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

[ECCV 2020] Gradient-Induced Co-Saliency Detection

source code of “Visual Saliency Transformer” (ICCV2021)

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

Task-related Saliency Network For Few-shot learning

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

Complete-IoU (CIoU) Loss and Cluster-NMS for Object Detection and Instance Segmentation (YOLACT)

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

[CVPR19] DeepCO³: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Codes for DeepCO³