[CVPR 2021] Forecasting the panoptic segmentation of future video frames

Niantic Labs

Last update: Nov 29, 2022

Related tags

Deep Learning pytorch semantic-segmentation cityscapes panoptic-segmentation future-prediction video-semantic-segmentation

Overview

Panoptic Segmentation Forecasting

Colin Graber, Grace Tsai, Michael Firman, Gabriel Brostow, Alexander Schwing - CVPR 2021

[Link to paper]

We propose to study the novel task of ‘panoptic segmentation forecasting’: given a set of observed frames, the goal is to forecast the panoptic segmentation for a set of unobserved frames. We also propose a first approach to forecasting future panoptic segmentations. In contrast to typical semantic forecasting, we model the motion of individual object instances and the background separately. This makes instance information persistent during forecasting, and allows us to understand the motion of each moving object.

⚙️ Setup

Dependencies

Python 3.7
PyTorch 1.5.1
pyyaml
pandas
h5py
opencv
tensorboard
tqdm
pytorch_scatter 2.0.5
cityscapesscripts (for evaluation)
Google Cloud SDK (for downloading data/models)

Install the code using the following command: pip install -e ./

Data

To run this code, the gtFine_trainvaltest dataset will need to be downloaded from the Cityscapes website into the data/ directory.
The remainder of the required data can be downloaded using the script download_data.sh. By default, everything is downloaded into the data/ directory.
Training the background model requires generating a version of the semantic segmentation annotations where foreground regions have been removed. This can be done by running the script scripts/preprocessing/remove_fg_from_gt.sh.
Training the foreground model requires additionally downloading a pretrained MaskRCNN model. This can be found at this link. This should be saved as pretrained_models/fg/mask_rcnn_pretrain.pkl.
Training the background model requires additionally downloading a pretrained HarDNet model. This can be found at this link. This should be saved as pretrained_models/bg/hardnet70_cityscapes_model.pkl.

Running our code

The scripts directory contains scripts which can be used to train and evaluate the foreground, background, and egomotion models. Specifically:

scripts/odom/run_odom_train.sh trains the egomotion prediction model.
scripts/odom/export_odom.sh exports the odometry predictions, which can then be used during evaluation by other models
scripts/bg/run_bg_train.sh trains the background prediction model.
scripts/bg/run_export_bg_val.sh exports predictions make by the background using input reprojected point clouds which come from using predicted egomotion.
scripts/fg/run_fg_train.sh trains the foreground prediction model.
scripts/fg/run_fg_eval_panoptic.sh produces final panoptic semgnetation predictions based on the trained foreground model and exported background predictions. This also uses predicted egomotion as input.

We provide our pretrained foreground, background, and egomotion prediction models. The data downloading script additionally downloads these models into the directory pretrained_models/

✏️ 📄 Citation

If you found our work relevant to yours, please consider citing our paper:

@inproceedings{graber-2021-panopticforecasting,
 title   = {Panoptic Segmentation Forecasting},
 author  = {Colin Graber and
            Grace Tsai and
            Michael Firman and
            Gabriel Brostow and
            Alexander Schwing},
 booktitle = {Computer Vision and Pattern Recognition ({CVPR})},
 year = {2021}
}

👩‍⚖️ License

An unofficial personal implementation of UM-Adapt, specifically to tackle joint estimation of panoptic segmentation and depth prediction for autonomous driving datasets.

Semisupervised Multitask Learning This repository is an unofficial and slightly modified implementation of UM-Adapt[1] using PyTorch. This code primar

11 Nov 25, 2022

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images This r

63 Dec 16, 2022

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

52 Dec 19, 2022

Code for "Learning to Segment Rigid Motions from Two Frames".

rigidmask Code for "Learning to Segment Rigid Motions from Two Frames". ** This is a partial release with inference and evaluation code.

157 Nov 21, 2022

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

FIERY This is the PyTorch implementation for inference and training of the future prediction bird's-eye view network as described in: FIERY: Future In

406 Dec 24, 2022

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

364 Jan 3, 2023

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

MiVOS (CVPR 2021) - Mask Propagation Ho Kei Cheng, Yu-Wing Tai, Chi-Keung Tang [arXiv] [Paper PDF] [Project Page] [Papers with Code] This repo impleme

106 Jan 3, 2023

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

RMNet This repository contains the source code for the paper Efficient Regional Memory Network for Video Object Segmentation. Cite this work @inprocee

76 Dec 14, 2022

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

Interactive Scene Reconstruction Project Page | Paper This repository contains the implementation of our ICRA2021 paper Reconstructing Interactive 3D

97 Dec 28, 2022

Comments

Can't run prediction and train scripts

Hi,

I successfully download the dataset using the download_data.sh and set up all the dependencies in my conda env. Then I tried to run the prediction and training scripts scripts/fg/run_fg_eval_panoptic.sh and scripts/fg/run_fg_train.sh, and faced the following errors.

python: can't open file 'panoptic_forecasting/experiments/export_cityscapes_segmentation_results.py': [Errno 2] No such file or directory python: can't open file 'panoptic_forecasting/experiments/train_model.py': [Errno 2] No such file or directory

I check the Github code and didn't find the folder experiments. Can you tell me how I can get the file panoptic_forecasting/experiments/export_cityscapes_segmentation_results.py? Thanks!

Best, Shaoyi

opened by szl0144 28
How to set a custom input timestamp in evaluation

Hi,

I see the current short-term and long-term evaluation input timestamp is fixed (Short term: 11, 14, 17, Long term: 5 8 11, label frame is 20th). Can you please tell me which code I need to focus on to set a custom input timestamp besides short term and long term?

I see the model forecasts future 3 frames in short term evaluation and 9 frames in long term evaluation, does the loss function only calculate the SQ, RQ and PQ between 3rd prediction frame and the 20th ground truth frame in short term and the loss between 9th prediction frame and the 20th ground truth in long term? The loss function doesn't calculate the loss of other prediction frames (such as 4th, 5th, 6th in this scenario) because of a lack of ground truth. Not sure about it and want to double-check it with you. Thanks!

Best, Shaoyi

opened by szl0144 15
Cannot download data

Hi,

Thanks for your work. We are trying to follow your work but we cannot download the data, although we successfully download the models.

The following two commands cannot be executed: """ gsutil -m cp gs://niantic-lon-static/research/panoptic-forecasting/preprocessed-data/fg/* data/fg/ gsutil -m cp -r gs://niantic-lon-static/research/panoptic-forecasting/preprocessed-data/bg/* data/bg/ """ Error msg : zsh: no matches found: gs://niantic-lon-static/research/panoptic-forecasting/preprocessed-data/bg/*

opened by HarborYuan 6
Fail to download dataset using the download_data.sh script

Hi,

I install the Google Cloud SDK in my virtual machine and run the dataset download script. When I cp the cloud data into my local data folder I faced an error

AccessDeniedException: 403 XXX does not have storage.objects.list access to the Google Cloud Storage bucket. CommandException: 1 file/object could not be transferred.

I am not familiar with the Google Cloud service, can you please tell me how to solve it? Thanks!

Best, Shaoyi

opened by szl0144 2

Owner

Niantic Labs

Building technologies and ideas that move us

GitHub

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation Xiao Fu1* Shangzhan Zhang1* Tianrun Chen1 Yichong Lu1 Lanyun Zhu2 Xi

37 May 17, 2022

Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).

What is judgyprophet? judgyprophet is a Bayesian forecasting algorithm based on Prophet, that enables forecasting while using information known by the

56 Oct 26, 2022

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

1 Jan 23, 2022

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

EOPSN: Exemplar-Based Open-Set Panoptic Segmentation Network (CVPR 2021) PyTorch implementation for EOPSN. We propose open-set panoptic segmentation t

49 Dec 30, 2022

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

Implementation for Pixel Consensus Voting (CVPR 2020). This codebase contains the essential ingredients of PCV, including various spatial discretizati

23 Oct 25, 2022

Implementation for Panoptic-PolarNet (CVPR 2021)

Panoptic-PolarNet This is the official implementation of Panoptic-PolarNet. [ArXiv paper] Introduction Panoptic-PolarNet is a fast and robust LiDAR po

126 Jan 1, 2023

Apply AnimeGAN-v2 across frames of a video clip

title emoji colorFrom colorTo sdk app_file pinned AnimeGAN-v2 For Videos ?? blue red gradio app.py false AnimeGAN-v2 For Videos Apply AnimeGAN-v2 acro

36 Oct 18, 2022

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction This is the code for the paper Combining E

69 Dec 26, 2022

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Spacetimeformer Multivariate Forecasting This repository contains the code for the paper, "Long-Range Transformers for Dynamic Spatiotemporal Forecast

440 Jan 2, 2023

Event-forecasting - Event Forecasting Algorithms With Python

event-forecasting Event Forecasting Algorithms Theory Correlating events in comp

4 Feb 15, 2022

[CVPR 2021] Forecasting the panoptic segmentation of future video frames

Related tags

Overview

Panoptic Segmentation Forecasting

⚙️ Setup

Dependencies

Data

Running our code

✏️ 📄 Citation

👩‍⚖️ License

You might also like...

An unofficial personal implementation of UM-Adapt, specifically to tackle joint estimation of panoptic segmentation and depth prediction for autonomous driving datasets.

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

Code for "Learning to Segment Rigid Motions from Two Frames".

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

Comments

Can't run prediction and train scripts

How to set a custom input timestamp in evaluation

Cannot download data

Fail to download dataset using the download_data.sh script

Owner

Niantic Labs

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

Implementation for Panoptic-PolarNet (CVPR 2021)

Apply AnimeGAN-v2 across frames of a video clip

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Event-forecasting - Event Forecasting Algorithms With Python