PyTorch reimplementation of minimal-hand (CVPR2020)

Hao Meng

Last update: Dec 29, 2022

Related tags

Deep Learning Minimal-Hand-pytorch

Overview

Minimal Hand Pytorch

Unofficial PyTorch reimplementation of minimal-hand (CVPR2020).

you can also find in youtube or bilibili

bare hand youtube or bilibili
hand-object intercation youtube or bilibili

This project reimplement following components :

Training (DetNet) and Evaluation Code
Shape Estimation
Pose Estimation: Instead of IKNet in original paper, an analytical inverse kinematics method is used.

Offical project link: [minimal-hand]

Update

2021/03/09 update about utils/LM.py, time cost drop from 12s/item to 1.57s/item
2021/03/12 update about utils/LM.py, time cost drop from 1.57s/item to 0.27s/item
2021/03/17 realtime perfomance is achieved when using PSO to estimate shape, coming soon
2021/03/20 Add PSO to estimate shape. ~~AUC is decreased by about 0.01 on STB and RHD datasets, and increased a little on EO and do datasets.~~ Modifiy utlis/vis.py to improve realtime perfomance
2021/03/24 Fixed some errors in calculating AUC. Update the 3D PCK AUC Diffenence.

Usage

Retrieve the code

git clone https://github.com/MengHao666/Minimal-Hand-pytorch
cd Minimal-Hand-pytorch

Create and activate the virtual environment with python dependencies

conda env create --file=environment.yml
conda activate minimal-hand-torch

Prepare MANO hand model

Download MANO model from here and unzip it.
Create an account by clicking Sign Up and provide your information
Download Models and Code (the downloaded file should have the format mano_v*_*.zip). Note that all code and data from this download falls under the MANO license.
unzip and copy the content of the models folder into the mano folder
Your structure should look like this:

Minimal-Hand-pytorch/
   mano/
      models/
      webuser/

Download and Prepare datasets

Training dataset

Evaluation dataset

STB Dataset ;

STB_supp: for license reason, download link could be found in bihand
DexterObjectDataset ;

DO_supp: Google Drive or Baidu Pan (s892)
EgoDexterDataset

EO_supp: Google Drive or Baidu Pan (axkm)

Processing

Create a data directory, extract all above datasets or additional materials in it

Now your data folder structure should like this:

data/

    CMU/
        hand143_panopticdb/
            datasets/
            ...
        hand_labels/
            datasets/
            ...

    RHD/
        RHD_published_v2/
            evaluation/
            training/
            view_sample.py
            ...

    GANeratedHands_Release/
        data/
        ...

    STB/
        images/
            B1Counting/
                SK_color_0.png
                SK_depth_0.png
                SK_depth_seg_0.png  <-- merged from STB_supp
                ...
            ...
        labels/
            B1Counting_BB.mat
            ...

    dexter+object/
        calibration/
        bbox_dexter+object.csv
        DO_pred_2d.npy
        data/
            Grasp1/
                annotations/
                    Grasp13D.txt
                    my_Grasp13D.txt
                    ...
                ...
            Grasp2/
                annotations/
                    Grasp23D.txt
                    my_Grasp23D.txt
                    ...
                ...
            Occlusion/
                annotations/
                    Occlusion3D.txt
                    my_Occlusion3D.txt
                    ...
                ...
            Pinch/
                annotations/
                    Pinch3D.txt
                    my_Pinch3D.txt
                    ...
                ...
            Rigid/
                annotations/
                    Rigid3D.txt
                    my_Rigid3D.txt
                    ...
                ...
            Rotate/
                                annotations/
                    Rotate3D.txt
                    my_Rotate3D.txt
                    ...
                ...
        

    EgoDexter/
        preview/
        data/
            Desk/
                annotation.txt_3D.txt
                my_annotation.txt_3D.txt
                ...
            Fruits/
                annotation.txt_3D.txt
                my_annotation.txt_3D.txt
                ...
            Kitchen/
                annotation.txt_3D.txt
                my_annotation.txt_3D.txt
                ...
            Rotunda/
                annotation.txt_3D.txt
                my_annotation.txt_3D.txt
                ...

Note

All code and data from these download falls under their own licenses.
DO represents "dexter+object" dataset; EO represents "EgoDexter" dataset
DO_supp and EO_supp are modified from original ones.
DO_pred_2d.npy are 2D predictions from 2D part of DetNet.
some labels of DO and EO is obviously wrong (u could find some examples with original labels from dexter_object.py or egodexter.py), when projected into image plane, thus should be omitted. Here come my_{}3D.txt and my_annotation.txt_3D.txt.

Download my Results

my_results: Google Drive or Baidu Pan (2rv7)
extract it in project folder
The parameters used in the real-time demo can be found google_drive or baidu (un06). It is provided by Hand-BMC-pytorch

realtime demo

python demo.py

DetNet Training and Evaluation

Run the training code

python train_detnet.py --data_root data/

Run the evaluation code

python train_detnet.py --data_root data/  --datasets_test testset_name_to_test   --evaluate  --evaluate_id checkpoints_id_to_load

or use my results

python train_detnet.py --checkpoint my_results/checkpoints  --datasets_test "rhd" --evaluate  --evaluate_id 106

python train_detnet.py --checkpoint my_results/checkpoints  --datasets_test "stb" --evaluate  --evaluate_id 71

python train_detnet.py --checkpoint my_results/checkpoints  --datasets_test "do" --evaluate  --evaluate_id 68

python train_detnet.py --checkpoint my_results/checkpoints  --datasets_test "eo" --evaluate  --evaluate_id 101

Shape Estimation

Run the shape optimization code. This can be very time consuming when the weight parameter is quite small.

python optimize_shape.py --weight 1e-5

or use my results

python optimize_shape.py --path my_results/out_testset/

Pose Estimation

Run the following code which uses a analytical inverse kinematics method.

python aik_pose.py

or use my results

python aik_pose.py --path my_results/out_testset/

Detnet training and evaluation curve

Run the following code to see my results

python plot.py --path my_results/out_loss_auc

(AUC means 3D PCK, and ACC_HM means 2D PCK)

3D PCK AUC Diffenence

* means this project

Dataset	DetNet(paper)	DetNet(*)	DetNet+IKNet(paper)	DetNet+LM+AIK(*)	DetNet+PSO+AIK(*)
RHD	-	0.9339	0.856	0.9301	0.9310
STB	0.891	0.8744	0.898	0.8647	0.8671
DO	0.923	0.9378	0.948	0.9392	0.9342
EO	0.804	0.9270	0.811	0.9288	0.9277

Note

Adjusting training parameters carefully, longer training time, more complicated network or Biomechanical Constraint Losses could further boost accuracy.
As there is no official open source of original paper, above comparison is a little rough.

Citation

This is the unofficial pytorch reimplementation of the paper "Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data" (CVPR 2020).

If you find the project helpful, please star this project and cite them:

@inproceedings{zhou2020monocular,
  title={Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data},
  author={Zhou, Yuxiao and Habermann, Marc and Xu, Weipeng and Habibie, Ikhsanul and Theobalt, Christian and Xu, Feng},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={0--0},
  year={2020}
}

Acknowledgement

Code of Mano Pytorch Layer was adapted from manopth.
Code for evaluating the hand PCK and AUC in utils/eval/zimeval.py was adapted from hand3d.
Part code of data augmentation, dataset parsing and utils were adapted from bihand and 3D-Hand-Pose-Estimation.
Code of network model was adapted from Minimal-Hand.
@Mrsirovo for the starter code of the utils/LM.py, @maitetsu update it later.
@maitetsu for the starter code of the utils/AIK.py

Comments

"ckp_siknet_synth_41.pth.tar"have a error

there is a error,how to solve ?

RuntimeError: G:/Minimal-Hand-pytorch/my_results/checkpoints/ckp_siknet_synth_41.pth.tar is a zip archive (did you mean to use torch.jit.load()?)

opened by LingSmart 12
what is the code result ?

Hello,the final result of your code is a model of OBJ file (which can be imported into 3D Max) or is it just a hand of 3d image displayed in the picture, such as looking at the picture representation: or obj file like this:

opened by LingSmart 9

Environment installation fails

Hey, I tried installing the environment, unfortunately, it fails either due to manopth or the MANO models. Can you help me out? Anything I need to specify differently?

(minimal-hand-torch) muellelu@e130-pc25:~/git/minimal-hand-pytorch$ python demo.py
Traceback (most recent call last):
  File "demo.py", line 3, in <module>
    from manopth import manolayer
  File "/home/AD/muellelu/xconda3/envs/minimal-hand-torch/lib/python3.7/site-packages/manopth/manolayer.py", line 7, in <module>
    from mano.webuser.smpl_handpca_wrapper_HAND_only import ready_arguments
  File "/home/AD/muellelu/git/minimal-hand-pytorch/mano/webuser/smpl_handpca_wrapper_HAND_only.py", line 137
    print 'FINITO'
                 ^
SyntaxError: Missing parentheses in call to 'print'. Did you mean print('FINITO')?
(minimal-hand-torch) muellelu@e130-pc25:~/git/minimal-hand-pytorch$

opened by Haydnspass 7

about the shape estimation

An issuefrom the official repo.

Hi, I am wondering, once the pose and shape parameters are estimated, the absolute scale of the hand model will also be recovered. Does it mean that the network estimates an absolute-scale hand model from a single image? I am a new hand in this area, is the MANO model scale-independent?

opened by zjhthu 7
problems about the Delta map and L map groundtruth

delta_map = kin_chain[:, :, np.newaxis, np.newaxis].repeat(32, axis=-2).repeat(32, axis=-1) It seems to make the value in W/H the same. is it right? We only need to train the corresponding position of the groundtruth. But we find the position of maximum value of the predicted heatmap to match the groundthuth.

opened by aqsc 7
Question about handutil.py

Hi,I'm reading through the code,and I get a little confused by the function get_affine_transform_test and get_affine_trans_no_rot,what's their utility? Your reply is highly appreciated.

opened by imabackstabber 6
Unable to install some dependencies
Hi! When I tried using the command 'conda env create --file=environment.yml', it threw me the following error:

Collecting package metadata (repodata.json): done Solving environment: failed

ResolvePackageNotFound:

pcre==8.44=he6710b0_0

pixman==0.40.0=h7b6447c_0

tornado==6.1=py37h27cfd23_0

libopencv==3.4.2=hb342d67_1

ld_impl_linux-64==2.33.1=h53a641e_7

qt==5.9.7=h5867ecd_1

libgfortran-ng==7.3.0=hdf63c60_0

freetype==2.10.4=h5ab3b9f_0

pandas==1.2.2=py37ha9443f7_0

sip==4.19.8=py37hf484d3e_0

libopus==1.3.1=h7b6447c_0

numpy-base==1.19.2=py37hfa32c7d_0

cffi==1.14.5=py37h261ae71_0

libxcb==1.14=h7b6447c_0

zstd==1.4.5=h9ceee32_0

readline==8.1=h27cfd23_0

libuv==1.40.0=h7b6447c_0

pyqt==5.9.2=py37h05f1152_2

harfbuzz==1.8.8=hffaf4a1_0

libuuid==1.0.3=h1bed415_2

gstreamer==1.14.0=h28cd5cc_2

torchvision==0.4.0=py37_cu100

libffi==3.3=he6710b0_2

ca-certificates==2021.1.19=h06a4308_0

lz4-c==1.9.3=h2531618_0

mkl_fft==1.3.0=py37h54f3939_0

expat==2.2.10=he6710b0_2

libtiff==4.1.0=h2733197_1

libgcc-ng==9.1.0=hdf63c60_0

libstdcxx-ng==9.1.0=hdf63c60_0

xz==5.2.5=h7b6447c_0

icu==58.2=he6710b0_3

glib==2.67.4=h36276a3_1

setuptools==52.0.0=py37h06a4308_0

pillow==8.1.0=py37he98fc37_0

termcolor==1.1.0=py37h06a4308_1

zlib==1.2.11=h7b6447c_3

libpng==1.6.37=hbc83047_0

python==3.7.10=hdb3f193_0

mkl==2020.2=256

scipy==1.6.1=py37h91f5cce_0

kiwisolver==1.3.1=py37h2531618_0

jasper==2.0.14=h07fcdf6_1

ncurses==6.2=he6710b0_1

mkl-service==2.3.0=py37he8ac12f_0

numpy==1.19.2=py37h54aff64_0

libxml2==2.9.10=hb55368b_3

pytorch==1.2.0=py3.7_cuda10.0.130_cudnn7.6.2_0

hdf5==1.10.2=hba1933b_1

py-opencv==3.4.2=py37hb342d67_1

ffmpeg==4.0=hcdf2ecd_0

pip==21.0.1=py37h06a4308_0

lcms2==2.11=h396b838_0

intel-openmp==2020.2=254

libvpx==1.7.0=h439df22_0

libglu==9.0.0=hf484d3e_1

matplotlib-base==3.3.4=py37h62a2d02_0

bzip2==1.0.8=h7b6447c_0

dbus==1.13.18=hb2f20db_0

mkl_random==1.1.1=py37h0573a6f_0

openssl==1.1.1j=h27cfd23_0

certifi==2020.12.5=py37h06a4308_0

fontconfig==2.13.1=h6c09931_0

graphite2==1.3.14=h23475e2_0

cudatoolkit==10.0.130=0

ninja==1.10.2=py37hff7bd54_0

tk==8.6.10=hbc83047_0

matplotlib==3.3.4=py37h06a4308_0

libedit==3.1.20191231=h14c3975_1

cairo==1.14.12=h8948797_3

freeglut==3.0.0=hf484d3e_5

sqlite==3.33.0=h62c20be_0

gst-plugins-base==1.14.0=h8213a91_2

jpeg==9b=h024ee3a_2

Do I need to install all these dependencies individually or should the above command have worked as it is? Please help me out as installing them individually is also not working with pip. Further, let me know if I need to use some package manager other than pip for individual installation.

Thanks in advance!
opened by sidoodler 6
Unable to run the code - (Unaware of input dimensions)

The image that I read is of dimension [225,225,3] the variable 'input' in line 100 in following code has the dimension [225,0,3].

Hello,

I'm facing trouble in running the code after setting up the environment. The error that I received is shown in the figure. Could you help me out?

The code that I have edited is as follows:

from random import sample import cv2 import torch from manopth import manolayer from model.detnet import detnet from utils import func, bone, AIK, smoother import numpy as np import matplotlib.pyplot as plt from utils import vis from op_pso import PSO import open3d

device = torch.device('cuda:0' if torch.cuda.is_available() else 'cpu') _mano_root = 'mano/models'

module = detnet().to(device) check_point = torch.load('model/checkpoint/ckp_detnet_106.pth', map_location=device) model_state = module.state_dict()

state = {}

for k, v in check_point.items(): if k in model_state: state[k] = v else: print(k, ' is NOT in current model') model_state.update(state) module.load_state_dict(model_state)

print('load model finished') pose, shape = func.initiate("zero") pre_useful_bone_len = np.zeros((1, 15)) pose0 = torch.eye(3).repeat(1, 16, 1, 1) #pose = pose.to(device) #shape = shape.to(device)

mano = manolayer.ManoLayer(flat_hand_mean=True, side="right", mano_root=_mano_root, use_pca=False, root_rot_mode='rotmat', joint_rot_mode='rotmat') #mano = mano.to(device)

print('start opencv') point_fliter = smoother.OneEuroFilter(4.0, 0.0) mesh_fliter = smoother.OneEuroFilter(4.0, 0.0) shape_fliter = smoother.OneEuroFilter(4.0, 0.0) print('opencv finished') flag = 1 plt.ion() f = plt.figure()

fliter_ax = f.add_subplot(111, projection='3d') #plt.show() view_mat = np.array([[1.0, 0.0, 0.0], [0.0, -1.0, 0], [0.0, 0, -1.0]])

flag = 1 plt.ion() f = plt.figure()

fliter_ax = f.add_subplot(111, projection='3d') plt.show() view_mat = np.array([[1.0, 0.0, 0.0], [0.0, -1.0, 0], [0.0, 0, -1.0]]) mesh = open3d.geometry.TriangleMesh() hand_verts, j3d_recon = mano(pose0, shape.float()) mesh.triangles = open3d.utility.Vector3iVector(mano.th_faces) hand_verts = hand_verts.clone().detach().cpu().numpy()[0] mesh.vertices = open3d.utility.Vector3dVector(hand_verts) viewer = open3d.visualization.Visualizer() viewer.create_window(width=480, height=480, window_name='mesh') viewer.add_geometry(mesh) viewer.update_renderer()

print('start pose estimate')

pre_uv = None shape_time = 0 opt_shape = None shape_flag = True img = cv2.imread('sample.jpg') cv2.waitKey(2000) input = np.flip(img.copy(), -1)

k = cv2.waitKey(1) & 0xFF if input.shape[0] > input.shape[1]: margin = (input.shape[0] - input.shape[1]) // 2 input = input[margin:-margin] else: margin = (input.shape[1] - input.shape[0]) // 2 input = input[:, margin:-margin]

img = input.copy() img = np.flip(img, -1) cv2.imshow("Capture_Test", img) input = cv2.resize(input, (128, 128)) input = torch.tensor(input.transpose([2, 0, 1]), dtype=torch.float, device=device) # hwc -> chw input = func.normalize(input, [0.5, 0.5, 0.5], [1, 1, 1]) result = module(input.unsqueeze(0))

pre_joints = result['xyz'].squeeze(0) now_uv = result['uv'].clone().detach().cpu().numpy()[0, 0] now_uv = now_uv.astype(np.float) trans = np.zeros((1, 3)) trans[0, 0:2] = now_uv - 16.0 trans = trans / 16.0 new_tran = np.array([[trans[0, 1], trans[0, 0], trans[0, 2]]]) pre_joints = pre_joints.clone().detach().cpu().numpy()

flited_joints = point_fliter.process(pre_joints)

fliter_ax.cla()

filted_ax = vis.plot3d(flited_joints + new_tran, fliter_ax) pre_useful_bone_len = bone.caculate_length(pre_joints, label="useful")

NGEN = 100 popsize = 100 low = np.zeros((1, 10)) - 3.0 up = np.zeros((1, 10)) + 3.0 parameters = [NGEN, popsize, low, up] pso = PSO(parameters, pre_useful_bone_len.reshape((1, 15)),_mano_root) pso.main() opt_shape = pso.ng_best opt_shape = shape_fliter.process(opt_shape)

opt_tensor_shape = torch.tensor(opt_shape, dtype=torch.float) _, j3d_p0_ops = mano(pose0, opt_tensor_shape) template = j3d_p0_ops.cpu().numpy().squeeze(0) / 1000.0 # template, m 21*3 ratio = np.linalg.norm(template[9] - template[0]) / np.linalg.norm(pre_joints[9] - pre_joints[0]) j3d_pre_process = pre_joints * ratio # template, m j3d_pre_process = j3d_pre_process - j3d_pre_process[0] + template[0] pose_R = AIK.adaptive_IK(template, j3d_pre_process) pose_R = torch.from_numpy(pose_R).float() #reconstruction hand_verts, j3d_recon = mano(pose_R, opt_tensor_shape.float()) mesh.triangles = open3d.utility.Vector3iVector(mano.th_faces) hand_verts = hand_verts.clone().detach().cpu().numpy()[0] hand_verts = mesh_fliter.process(hand_verts) hand_verts = np.matmul(view_mat, hand_verts.T).T hand_verts[:, 0] = hand_verts[:, 0] - 50 hand_verts[:, 1] = hand_verts[:, 1] - 50 mesh_tran = np.array([[-new_tran[0, 0], new_tran[0, 1], new_tran[0, 2]]]) hand_verts = hand_verts - 100 * mesh_tran

mesh.vertices = open3d.utility.Vector3dVector(hand_verts) mesh.paint_uniform_color([228 / 255, 178 / 255, 148 / 255]) mesh.compute_triangle_normals() mesh.compute_vertex_normals() viewer.update_geometry(mesh) viewer.poll_events()

opened by harish-udhay 4
about show result.

Hello , I have install open3d properly , but when I run the demo, can't show the result, just white board(both demo.py and demo_dl.py), I wonder whether some result error?

opened by llpspark 4
The filedata"STB" is missing and another file data “dexter+object” is different of data structure

hello , there are three questions 1.your data is STB datasets I cant open the website,can you send my email ([email protected])？ thank you! 2.another file data “dexter+object” is different of data structure see the follow picture. in my unzip file "dexter+object",there is missing file "bbox_dexter+object.csv" and "Do_pred_2d.npy"。 3.in code demo.py -"check_point = torch.load('new_check_point/ckp_detnet_83.pth', map_location=device)" run demo.py ，then result：Traceback (most recent call last): File "G:/Minimal-Hand-pytorch-main/demo.py", line 16, in check_point = torch.load('new_check_point/ckp_detnet_83.pth', map_location=device) File "G:\software-setup\python 3.6.7-setup\lib\site-packages\torch\serialization.py", line 366, in load load model start f = open(f, 'rb') FileNotFoundError: [Errno 2] No such file or directory: 'new_check_point/ckp_detnet_83.pth' "new_check_point/ckp_detnet_83.pth" the file i don't find it. please answer this .thank you !

opened by LingSmart 4
[fix] Fix cannot save best ckpt
I found that when I'm training Detnet, best checkpoint cannot be saved.Then I find there is a problem in code that directly assign best_acc to auc,namely

auc = best_acc

make it to

auc = best_acc.copy()

will fix it
opened by imabackstabber 3

Owner

Hao Meng

Master student at Beihang University , mainly interested in hand pose estimation. (LOOKING FOR RESEARCH INTERNSHIP NOW.)

GitHub

A hand tracking demo made with mediapipe where you can control lights with pinching your fingers and moving your hand up/down.

HandTrackingBrightnessControl A hand tracking demo made with mediapipe where you can control lights with pinching your fingers and moving your hand up

19 Feb 12, 2022

Hand-distance-measurement-game - Hand Distance Measurement Game

Hand Distance Measurement Game This is program is made to calculate the distance

2 Jan 12, 2022

Control-Raspberry-Pi-Robot-using-Hand-Gestures - A 4WD Robot car based on Raspberry Pi that controlled by hand gestures(using openCV and mediapipe)

Control-Raspberry-Pi-Robot-using-Hand-Gestures you can see all details about thi

27 Dec 13, 2022

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Minimal Hand A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run. This project provides the

824 Jan 7, 2023

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Minimal code and simple experiments to play with Denoising Diffusion Probabilist

16 Oct 6, 2022

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

VectorNet Re-implementation This is the unofficial pytorch implementation of CVPR2020 paper "VectorNet: Encoding HD Maps and Agent Dynamics from Vecto

120 Jan 6, 2023

Code for the Active Speakers in Context Paper (CVPR2020)

Active Speakers in Context This repo contains the official code and models for the "Active Speakers in Context" CVPR 2020 paper. Before Training The c

43 Oct 14, 2022

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

CMPC-Refseg Code of our CVPR 2020 paper Referring Image Segmentation via Cross-Modal Progressive Comprehension. Shaofei Huang*, Tianrui Hui*, Si Liu,

55 Dec 1, 2022

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Learning Invariant Representation for Unsupervised Image Restoration (CVPR 2020) Introduction This is an implementation for the paper "Learning Invari

88 Nov 7, 2022

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

DenseNAS The code of the CVPR2020 paper Densely Connected Search Space for More Flexible Neural Architecture Search. Neural architecture search (NAS)

291 Nov 18, 2022

An unofficial implementation of "Unpaired Image Super-Resolution using Pseudo-Supervision." CVPR2020

UnpairedSR An unofficial implementation of "Unpaired Image Super-Resolution using Pseudo-Supervision." CVPR2020 turn RCAN(modified) --> xmodel(xilinx

10 Oct 28, 2022

RCDNet: A Model-driven Deep Neural Network for Single Image Rain Removal (CVPR2020)

RCDNet: A Model-driven Deep Neural Network for Single Image Rain Removal (CVPR2020) Hong Wang, Qi Xie, Qian Zhao, and Deyu Meng [PDF] [Supplementary M

6 Sep 27, 2022

Super Pix Adv - Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)

Super_Pix_Adv Offical implemention of Robust Superpixel-Guided Attentional Adver

8 Oct 26, 2022

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

text recognition toolbox 1. 项目介绍该项目是基于pytorch深度学习框架，以统一的改写方式实现了以下6篇经典的文字识别论文，论文的详情如下。该项目会持续进行更新，欢迎大家提出问题以及对代码进行贡献。模型论文标题发表年份模型方法划分 CRNN 《An End-t

168 Dec 24, 2022

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

Involution: Inverting the Inherence of Convolution for Visual Recognition Unofficial PyTorch reimplementation of the paper Involution: Inverting the I

100 Dec 1, 2022

PyTorch reimplementation of minimal-hand (CVPR2020)

Related tags

Overview

Minimal Hand Pytorch

Update

Usage

Prepare MANO hand model

Download and Prepare datasets

Training dataset

Evaluation dataset

Processing

Note

Download my Results

realtime demo

DetNet Training and Evaluation

Shape Estimation

Pose Estimation

Detnet training and evaluation curve

3D PCK AUC Diffenence

Note

Citation

Acknowledgement

Comments

An issuefrom the official repo.

Owner

Hao Meng

A hand tracking demo made with mediapipe where you can control lights with pinching your fingers and moving your hand up/down.

Hand-distance-measurement-game - Hand Distance Measurement Game

Control-Raspberry-Pi-Robot-using-Hand-Gestures - A 4WD Robot car based on Raspberry Pi that controlled by hand gestures(using openCV and mediapipe)

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

Code for the Active Speakers in Context Paper (CVPR2020)

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

An unofficial implementation of "Unpaired Image Super-Resolution using Pseudo-Supervision." CVPR2020

RCDNet: A Model-driven Deep Neural Network for Single Image Rain Removal (CVPR2020)

Super Pix Adv - Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

PyTorch reimplementation of REALM and ORQA

a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version