Using this codebase as a tool for my own research. Making some modifications to the original repo for my own purposes.

Andrew Jong

Last update: Nov 10, 2021

Related tags

Deep Learning LIP_JPPNet-for-SwapNet

Overview

For SwapNet

Create a list.txt file containing all the images to process. This can be done with the GNU find command:

find path/to/input/folder -name '*.jpg' -o -name '*.png' > list.txt

Then run this to get the clothing segmentations

python evaluate_parsing_JPPNet-s2.py -d path/to/texture -l path/to/list.txt -o path/to/clothing

Joint Body Parsing & Pose Estimation Network (JPPNet)

Xiaodan Liang, Ke Gong, Xiaohui Shen, and Liang Lin, "Look into Person: Joint Body Parsing & Pose Estimation Network and A New Benchmark", T-PAMI 2018.

Introduction

JPPNet is a state-of-art deep learning methord for human parsing and pose estimation built on top of Tensorflow.

This novel joint human parsing and pose estimation network incorporates the multiscale feature connections and iterative location refinement in an end-to-end framework to investigate efficient context modeling and then enable parsing and pose tasks that are mutually beneficial to each other. This unified framework achieves state-of-the-art performance for both human parsing and pose estimation tasks.

This distribution provides a publicly available implementation for the key model ingredients reported in our latest paper which is accepted by T-PAMI 2018.

We simplify the network to solve human parsing by exploring a novel self-supervised structure-sensitive learning approach, which imposes human pose structures into the parsing results without resorting to extra supervision. There is also a public implementation of this self-supervised structure-sensitive JPPNet (SS-JPPNet).

Look into People (LIP) Dataset

The SSL is trained and evaluated on our LIP dataset for human parsing. Please check it for more model details. The dataset is also available at google drive and baidu drive.

Pre-trained models

We have released our trained models of JPPNet on LIP dataset at google drive and baidu drive.

Inference

Download the pre-trained model and store in $HOME/checkpoint.
Prepare the images and store in $HOME/datasets.
Run evaluate_pose_JPPNet-s2.py for pose estimation and evaluate_parsing_JPPNet-s2.py for human parsing.
The results are saved in $HOME/output

Training

Download the pre-trained model and store in $HOME/checkpoint.
Download LIP dataset or prepare your own data and store in $HOME/datasets.
For LIP dataset, we have provided images, parsing labels, lists and the left-right flipping labels (labels_rev) for data augmentation. You need to generate the heatmaps of pose labels. We have provided a script for reference.
Run train_JPPNet-s2.py to train the JPPNet with two refinement stages.
Use evaluate_pose_JPPNet-s2.py and evaluate_parsing_JPPNet-s2.py to generate the results or evaluate the trained models.
Note that the LIPReader class is only suit for labels in LIP for the left-right flipping augmentation. If you want to train on other datasets with different labels, you may have to re-write an image reader class.

Citation

If you use this code for your research, please cite our papers.

@article{liang2018look,
  title={Look into Person: Joint Body Parsing \& Pose Estimation Network and a New Benchmark},
  author={Liang, Xiaodan and Gong, Ke and Shen, Xiaohui and Lin, Liang},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2018},
  publisher={IEEE}
}

@InProceedings{Gong_2017_CVPR,
  author = {Gong, Ke and Liang, Xiaodan and Zhang, Dongyu and Shen, Xiaohui and Lin, Liang},
  title = {Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {July},
  year = {2017}
}

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

104 Nov 25, 2022

This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations at CVPR'21. According to some product reasons, we are not planning to release the training/testing codes and models. However, we will release the dataset and the scripts to prepare the dataset.

TransFill-Reference-Inpainting This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transf

80 Dec 8, 2022

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Regularized Greedy Forest Regularized Greedy Forest (RGF) is a tree ensemble machine learning method described in this paper. RGF can deliver better r

364 Dec 28, 2022

Using this codebase as a tool for my own research. Making some modifications to the original repo for my own purposes.

Related tags

Overview

For SwapNet

Joint Body Parsing & Pose Estimation Network (JPPNet)

Introduction

Look into People (LIP) Dataset

Pre-trained models

Inference

Training

Citation

You might also like...

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase

Modified fork of Xuebin Qin's U-2-Net Repository. Used for demonstration purposes.

This is a JAX implementation of Neural Radiance Fields for learning purposes.

naked is a Python tool which allows you to strip a model and only keep what matters for making predictions.

A tool for making map images from OpenTTD save games

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Owner

Andrew Jong

In this project, we'll be making our own screen recorder in Python using some libraries.

Script that receives an Image (original) and a set of images to be used as "pixels" in reconstruction of the Original image using the set of images as "pixels"

Detection of drones using their thermal signatures from thermal camera through YOLO-V3 based CNN with modifications to encapsulate drone motion

A PaddlePaddle implementation of STGCN with a few modifications in the model architecture in order to forecast traffic jam.

The original weights of some Caffe models, ported to PyTorch.

This repository is all about spending some time the with the original problem posed by Minsky and Papert

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

I created My own Virtual Artificial Intelligence named genesis, He can assist with my Tasks and also perform some analysis,,

This repo contains research materials released by members of the Google Brain team in Tokyo.