Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Hehe Fan

Last update: Dec 9, 2022

Related tags

Deep Learning Point-Spatio-Temporal-Convolution

Overview

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Introduction

Point cloud sequences are irregular and unordered in the spatial dimension while exhibiting regularities and order in the temporal dimension. Therefore, existing grid based convolutions for conventional video processing cannot be directly applied to spatio-temporal modeling of raw point cloud sequences. In the paper, we propose a point spatio-temporal (PST) convolution to achieve informative representations of point cloud sequences. The proposed PST convolution first disentangles space and time in point cloud sequences. Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension. Furthermore, we incorporate the proposed PST convolution into a deep network, namely PSTNet, to extract features of 3D point cloud sequences in a spatio-temporally hierarchical manner.

Installation

The code is tested with Red Hat Enterprise Linux Workstation release 7.7 (Maipo), g++ (GCC) 8.3.1, PyTorch v1.2, CUDA 10.2 and cuDNN v7.6.

Install PyTorch v1.2:

pip install torch==1.2.0 torchvision==0.4.0

Compile the CUDA layers for PointNet++, which we used for furthest point sampling (FPS) and radius neighbouring search:

cd modules
python setup.py install

To see if the compilation is successful, try to run python modules/pst_convolutions.py to see if a forward pass works.

Install Mayavi for point cloud visualization (optional). Desktop is required.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{fan2021pstnet,
    title={PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences},
    author={Hehe Fan and Xin Yu and Yuhang Ding and Yi Yang and Mohan Kankanhalli},
    booktitle={International Conference on Learning Representations},
    year={2021}
}

Related Repos

PointNet++ PyTorch implementation: https://github.com/facebookresearch/votenet/tree/master/pointnet2
MeteorNet: https://github.com/xingyul/meteornet
3DV: https://github.com/3huo/3DV-Action

Comments

Default result only got 84 on MSRA

Great works! I've read it thoroughly and can easily run your code.

One issue is when I directly use the train-msr and finish train, the final acc is 84 and 83 for two repeat. I think they are with 16 frames. My repeats get lower scoring comparing to the paper.

I think it might because there are some parameter change in the script. Would you be convenient to check out?

opened by Jarrome 12
PSTNET on SemanticKitti/nuScenes dataset

Great work on the Point Tubes. I am particularly interested in the 4D semantic segmentation applications. I was wondering if you tried the PSTNet on benchmark datasets like Semantic Kitti or nuScenes dataset. These pointcloud sequences are much more sparse than the SYNTHIA dataset.

Thank you

opened by sandeepnmenon 6
An Issue related to running the furthest_point_sampling function

Dear Author,

Hope you are well.

I got an issue related to the pointnet2_utils.py and setup.py files. When I stalled setup.py the file, I got a warning below related to the GCC version.

Then when I was running the highlighted part within the setup.py below, I got an error of "Process finished with exit code 139 (interrupted by signal 11: SIGSEGV)".

Just wondering if you happen to know what's the issue? Does it related to the g++version?

Our PyTorch version is 1.2.0.

Many thanks! @hehefan

opened by NeilCui6 4
Training on NTU 60 XSUB

Thanks for your awesome work and generous code sharing. Recently, I have tried to reproduce the results on NTU 60 XSUB benchmark with one 3090 GPU. However, it seems the training is really slow and it would take more than a week to train 20 epochs following the original setting. I have tried to enlarge the batch size, and it still quite slow. So, what is the reasonable training time for NTU 60 dataset? Is there anything I have missed?

Looking forward to your reply.

Thanks

opened by erinchen824 3
Request for the MSR data set in npy format

Hi Hehe,

Can you pleaase share the npy data file for the MSR data set. It is a bit struggle for me to phrase bin data into npy. Hope you can help me.

Best, Xin

opened by Delusionx1 1
about usage

Hi, @hehefan ,

Thanks for releasing the package. Would you also release a basic step-by-step guide for the usage of PSTConv? e.g, data preparation, training steps and prediction steps...

Thanks~

opened by amiltonwong 1

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Related tags

Overview

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Introduction

Installation

Citation

Related Repos

Comments

Default result only got 84 on MSRA

PSTNET on SemanticKitti/nuScenes dataset

An Issue related to running the furthest_point_sampling function

Training on NTU 60 XSUB

Request for the MSR data set in npy format

about usage

Owner

Hehe Fan

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

Code for paper ECCV 2020 paper: Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop.

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"

Official PyTorch implementation for paper Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data.

Official implementation of the ICLR 2021 paper

Implementation of Nyström Self-attention, from the paper Nyströmformer

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement".

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes", CVPR 2021

Implementation of Barlow Twins paper

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch.

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning