Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification
Suncheng Xiang
Shanghai Jiao Tong University
Overview
In this paper, we construct and label a large-scale synthetic person dataset named FineGPR with fine-grained attribute distribution. Moreover, aiming to fully exploit the potential of FineGPR and promote the efficient training from millions of synthetic data, we propose an attribute analysis pipeline AOST to learn attribute distribution in target domain, then apply style transfer network to eliminate the gap between synthetic and real-world data and thus is freely deployed to new scenarios. Experiments conducted on benchmarks demonstrate that FineGPR with AOST outperforms (or is on par with) existing real and synthetic datasets, which suggests its feasibility for re-ID and proves the proverbial less-is-more principle. We hope this fine-grained dataset could advance research towards re-ID in real scenarios.
[Paper] [Video Sample] [Related Project]
π₯
NEWS
π₯
-
[10/2021]
π£ The first FineGPR-C caption dataset involving human describing event is coming ! -
[09/2021]
π£ The large-scale synthetic person dataset FineGPR with fine-grained attribute distribution is released !
π
Table of Contents- FineGPR Introduction
- Comparison with existing datasets
- Link of the Dataset
- Method
- Results
- Extendibility
- FineGPR-Caption dataset
- Citation
- Ethical Considerations
- LICENSE
- Acknowledgements
FineGPR Introduction
The FineGPR dataset is generated by a popular GTA5 game engine that can synthesise images under controllable viewpoints, weathers,illuminations and backgrounds, as well as 13 fine-grained attributes at the identity level
Our FineGPR dataset provides fine-grained and accurately configurable annotations, including 36 different viewpoints, 7 different kinds of weathers, 7 different kinds of illuminations, and 9 different kinds of backgrounds.
π·
ViewpointDefinition of different viewpoints. Viewpoints of one identity are sampled at an interval of 10Β°, e.g. 0Β°-80Β° denotes that a person has 9 different angles in total.
π¨
and Illumination
π
WeatherThe exemplars of different weather distribution (left) and illumination distribution (right) from the proposed FineGPR dataset.
βΉοΈββοΈ
Attributes at the Identity LevelThe distributions of attributes at the identity level on FineGPR. The left figure shows the numbers of IDs for each attribute. The middle and right pies illustrate the distribution of the colors of upper-body and low-body clothes respectively.
Some visual exemplars with ID-level pedestrian attributes in the proposed FineGPR dataset, such as Wear short sleeve , Wear dress, Wear hat, Carry bag, etc.
Comparison with existing datasets
Some Mainstream Datasets for Person Re-Identification
For related FineGPR dataset (details of the previous related work, please refer to the our homepage GPR
dataset | IDs (ID-Attributes) | boxs | cams | weathers | illumination | scene | resolution |
---|---|---|---|---|---|---|---|
Market-1501 | 1,501 ( |
32,668 | 6 | - | - | - | low |
CUHK03 | 1,467 ( |
14,096 | 2 | - | - | - | low |
DukeMTMC-reID | 1,404 ( |
36,411 | 8 | - | - | - | low |
MSMT17 | 4,101 ( |
126,441 | 15 | - | - | - | vary |
SOMAset | 50 ( |
100,000 | 250 | - | - | - | - |
SyRI | 100 ( |
1,680,000 | 100 | - | 140 | - | - |
PersonX | 1,266 ( |
273,456 | 6 | - | - | 1 | vary |
Unreal | 3,000 ( |
120,000 | 34 | - | - | 1 | low |
RandPerson | 8,000 ( |
1,801,816 | 19 | - | - | 4 | low |
FineGPR | 1150 ( |
2,028,600 | 36 | 7 | 7 | 9 | high |
Link of the Dataset
Data of FineGPR for Viewpoint Analysis
-
SJTU Yun Drive:
- Download Link password: qbdg
-
Baidu Yun Drive:
- Download Link password: h4k5
-
Microsoft OneDrive:
Directories & Files of images
FineGPR_Dataset
βββ FineGPR/ # This file is our original dataset, we provide the samples of ID=0001 and ID=0003 in this file folder.
β βββ 0001
β β βββ 0001_c01_w01_l01_p01.jpg
β β βββ 0001_c01_w01_l02_p01.jpg
β β βββ 0001_c01_w01_l03_p01.jpg
β β βββ ...
β βββ 0003/
β β βββ 0003_c01_w01_l01_p06.jpg
β β βββ 0003_c01_w01_l02_p06.jpg
β β βββ 0003_c01_w01_l03_p06.jpg
β β βββ ...
β βββ ...
βββ FineGPR_subset # This file is the subset of FineGPR dataset, each Identity contains 4 images.
β βββ 0001_c01_w03_l05_p03.jpg
β βββ 0001_c10_w03_l05_p03.jpg
β βββ 0001_c19_w03_l05_p03.jpg
β βββ 0001_c28_w03_l05_p03.jpg
β βββ 0003_c01_w03_l05_p08.jpg
β βββ 0003_c10_w03_l05_p08.jpg
β βββ 0003_c19_w03_l05_p08.jpg
β βββ 0003_c28_w03_l05_p08.jpg
β βββ ...
βββ README.md # Readme file
Name of the image
Taking "0001_c01_w01_l01_p01.jpg" as an example:
- 0001 is the id of the person
- c01 is the id of the camera
- w01 is the id of the weather
- l01 is the id of the illumination
- p01 is the id of the background
Viewpoint annotations
FineGPR
βββ c01οΌ90Β° βββ c10οΌ180Β° βββ c19οΌ270Β° βββ c28οΌ0Β°
βββ c02οΌ100Β° βββ c11οΌ190Β° βββ c20οΌ280Β° βββ c29οΌ10Β°
βββ c03οΌ110Β° βββ c12οΌ200Β° βββ c21οΌ290Β° βββ c30οΌ20Β°
βββ c04οΌ120Β° βββ c13οΌ210Β° βββ c22οΌ300Β° βββ c31οΌ30Β°
βββ c05οΌ130Β° βββ c14οΌ220Β° βββ c23οΌ310Β° βββ c32οΌ40Β°
βββ c06οΌ140Β° βββ c15οΌ230Β° βββ c24οΌ320Β° βββ c33οΌ50Β°
βββ c07οΌ150Β° βββ c16οΌ240Β° βββ c25οΌ330Β° βββ c34οΌ60Β°
βββ c08οΌ160Β° βββ c17οΌ250Β° βββ c26οΌ340Β° βββ c35οΌ70Β°
βββ c09οΌ170Β° βββ c18οΌ260Β° βββ c27οΌ350Β° βββ c36οΌ80Β°
Weather annotations
FineGPR
βββ w01οΌSunny
βββ w02οΌClouds
βββ w03οΌOvercast
βββ w04οΌFoggy
βββ w05οΌNeutral
βββ w06οΌBlizzard
βββ w07οΌSnowlight
Illumination annotations
FineGPR
βββ l01οΌMidnight
βββ l02οΌDawn
βββ l03οΌForenoon
βββ l04οΌNoon
βββ l05οΌAfternoon
βββ l06οΌDusk
βββ l07οΌNight
Scene annotations
FineGPR
βββ p01οΌUrban
βββ p02οΌUrban
βββ p03οΌWild
βββ p04οΌUrban
βββ p05οΌWild
βββ p06οΌUrban
βββ p07οΌUrban
βββ p08οΌWild
βββ p09οΌUrban
Method
Results
Performance comparison with existing Real and Synthetic datasets on Market-1501, DukeMTMC-reID and CUHK03, respectively.
References
- [1] Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. CVPR 2018.
- [2] Bag of tricks and a strong baseline for deep person re-identification. CVPRW 2019.
Extendibility
Accompanied with our FineGPR, we also provide some human body masks (Middle) and keypoint locations (Bottom) of all characters during the annotation. We hope that our synthetic dataset FineGPR can not only contribute a lot to the development of generalizable person re-ID, but also advance the research of other computer vision tasks, such as human part segmentation and pose estimation.
FineGPR-C caption dataset
On the basis of FineGPR dafaset, we introduce a dynamic strategy to generate high-quality captions with fine-grained attribute annotations for semantic-based pretraining. To be more specific, we rearrange the different attributes as word embeddings into caption formula in the different position, and then generate semantically dense caption with high-quality description, which gives rise to our newly constructed FineGPR-C caption dataset.
A small subset of FineGPR-C caption dataset can be downloaded from the following links:
- Microsoft OneDrive:
Citation
If you use our FineGPR dataset for your research, please cite our Paper.
@article{xiang2021less,
title={Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification},
author={Xiang, Suncheng and You, Guanjie and Guan, Mengyuan and Chen, Hao and Wang, Feng and Liu, Ting and Fu, Yuzhuo},
journal={arXiv preprint arXiv:2109.10498},
year={2021}
}
If you do think this FineGPR-C caption dataset is useful and have used it in your research, please cite our Paper.
@article{xiang2021vtbr,
title={VTBR: Semantic-based Pretraining for Person Re-Identification},
author={Xiang, Suncheng and Zhang, Zirui and Guan, Mengyuan and Chen, Hao and Yan, Binjie and Liu, Ting and Fu, Yuzhuo},
journal={arXiv preprint arXiv:2110.05074},
year={2021}
}
Ethical Considerations
Our task and dataset were created with careful attention to ethical questions, which we encountered throughout our work. Access to our dataset will be provided for research purposes only and with restrictions on redistribution. Additionally, as we filtered out the sensitive attribute name in our fine-grained attribute annotation, our dataset cannot be easily repurposed for unintended tasks. Importantly, we are very cautious of human-annotation procedure of large scale datasets towards the social and ethical implications. Furthermore, we do not consider the datasets for developing non-research systems without further processing or augmentation. We hope this fine-grained dataset will shed light into potential tasks for the research community to move forward.
LICENSE
- The FineGPR Dataset and FineGPR-C caption is made available for non-commercial purposes only.
- You will not, directly or indirectly, reproduce, use, or convey the FineGPR dataset and FineGPR-C caption dataset or any Content, or any work product or data derived therefrom, for commercial purposes.
Permissions of this strong copyleft license (GNU General Public License v3.0) are conditioned on making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. Contributors provide an express grant of patent rights.
Acknowledgements
This research was supported by the National Natural Science Foundation of China under Project (Grant No. 61977045). We would like to thank authors of FineGPR, and FineGPR-Caption dataset for their work. They provide tremendous efforts in these dataset to advance the research in this field. We also appreciate Zefang Yu, Mingye Xie and Guanjie You for insightful feedback and discussion.
For further questions and suggestions about our datasets and methods, please feel free to contact Suncheng Xiang: [email protected]