[arXiv] [Project Page]
FaPN: Feature-aligned Pyramid Network for Dense Image Prediction@inproceedings{
huang2021fapn,
title={{FaPN}: Feature-aligned Pyramid Network for Dense Image Prediction},
author={Shihua Huang and Zhichao Lu and Ran Cheng and Cheng He},
booktitle={International Conference on Computer Vision (ICCV)},
year={2021}
}
Overview
FaPN vs. FPN | Before vs. After Alignment |
---|---|
This project provides the whole official implementation for our ICCV2021 paper "FaPN: Feature-aligned Pyramid Network for Dense Image Prediction" based on Detectron2, PanoticFCN, and MaskFormer. FaPN is a simple yet effective top-down pyramidal architecture to generate multi-scale features for dense image prediction. Comprised of a feature alignment module (FAM) and a feature selection module (FSM), FaPN addresses the issue of feature alignment in the original FPN, leading to substaintial improvements on various dense prediction tasks, such as object detection, semantic, instance, panoptic segmentation, etc.
Installation
This project is based on Detectron2, which can be constructed as follows.
- Install Detectron2 following the instructions.
- Setup the dataset following the structure.
- Install DCNv2 following Install DCNv2.md.
Results
COCO Object Detection
Faster R-CNN + FaPN:
Name | lr sched |
box AP |
box APs |
box APm |
box APl |
download |
---|---|---|---|---|---|---|
R50 | 1x | 39.2 | 24.5 | 43.3 | 49.1 | model | log |
R101 | 3x | 42.8 | 27.0 | 46.2 | 54.9 | model | log |
Cityscapes Semantic Segmentation
PointRend + FaPN:
Name | lr sched |
mask mIoU |
mask i_IoU |
mask IoU_sup |
mask iIoU_sup |
download |
---|---|---|---|---|---|---|
R50 | 1x | 80.0 | 61.3 | 90.6 | 78.5 | model | log |
R101 | 1x | 80.1 | 62.2 | 90.8 | 78.6 | model | log |
ADE20K-150 Semantic Segmentation
MaskFormer + FaPN:
Name | mIoU Single-Scale |
mIoU Multi-Scale |
download |
---|---|---|---|
Swin+Large+IN21K | 55.2 | 56.7 | model | log |
COCOStuff-10K Semantic Segmentation
MaskFormer + FaPN:
Name | mIoU Single-Scale |
mIoU Multi-Scale |
download |
---|---|---|---|
R101 | 39.6 | 40.6 | model | log |
COCO Instance Segmentation
Mask R-CNN + FaPN:
Name | lr sched |
mask AP |
mask APs |
box AP |
box APs |
download |
---|---|---|---|---|---|---|
R50 | 1x | 36.4 | 18.1 | 39.8 | 24.3 | model | log |
R101 | 3x | 39.4 | 20.9 | 43.8 | 27.4 | model | log |
PointRend + FaPN:
Name | lr sched |
mask AP |
mask APs |
box AP |
box APs |
download |
---|---|---|---|---|---|---|
R50 | 1x | 37.6 | 18.6 | 39.4 | 24.2 | model | log |
COCO Panoptic Segmentation
PanopticFPN + FaPN:
Name | lr sched |
PQ | mask mIoU |
St PQ |
box AP |
Th PQ |
download |
---|---|---|---|---|---|---|---|
R50 | 1x | 41.1 | 43.4 | 32.5 | 38.7 | 46.9 | model | log |
R101 | 3x | 44.2 | 45.7 | 35.0 | 43.0 | 53.3 | model | log |
PanopticFCN + FaPN:
Name | lr sched |
PQ | mask mIoU |
St PQ |
box AP |
Th PQ |
download |
---|---|---|---|---|---|---|---|
R50 | 1x | 41.8 | 42.0 | 33.1 | 32.3 | 47.6 | model | log |
R50-600 | 3x | 43.5 | 43.5 | 35.1 | 34.5 | 49.0 | model | log |