10 Python Mscoco Libraries

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

bottom-up-attention This code implements a bottom-up attention model, based on multi-gpu training of Faster R-CNN with ResNet-101, using object and at

1.3k Jan 9, 2023

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"

Show, Attend and Tell Update (December 2, 2016) TensorFlow implementation of Show, Attend and Tell: Neural Image Caption Generation with Visual Attent

902 Nov 29, 2022

TensorFlow Implementation of "Show, Attend and Tell"

Show, Attend and Tell Update (December 2, 2016) TensorFlow implementation of Show, Attend and Tell: Neural Image Caption Generation with Visual Attent

902 Nov 29, 2022

This repository contains the source code of our work on designing efficient CNNs for computer vision

Efficient networks for Computer Vision This repo contains source code of our work on designing efficient networks for different computer vision tasks:

386 Nov 26, 2022

Boundary-preserving Mask R-CNN (ECCV 2020)

BMaskR-CNN This code is developed on Detectron2 Boundary-preserving Mask R-CNN ECCV 2020 Tianheng Cheng, Xinggang Wang, Lichao Huang, Wenyu Liu Video

178 Nov 28, 2022

Simple Baselines for Human Pose Estimation and Tracking

Simple Baselines for Human Pose Estimation and Tracking News Our new work High-Resolution Representations for Labeling Pixels and Regions is available

2.7k Jan 5, 2023

A PyTorch toolkit for 2D Human Pose Estimation.

PyTorch-Pose PyTorch-Pose is a PyTorch implementation of the general pipeline for 2D single human pose estimation. The aim is to provide the interface

1.1k Dec 30, 2022

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Swin Transformer for Object Detection This repo contains the supported code and configuration files to reproduce object detection results of Swin Tran

1.4k Dec 30, 2022

SWA Object Detection

SWA Object Detection This project hosts the scripts for training SWA object detectors, as presented in our paper: @article{zhang2020swa, title={SWA

237 Nov 28, 2022

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Deep High-Resolution Representation Learning for Human Pose Estimation (CVPR 2019) News [2020/07/05] A very nice blog from Towards Data Science introd

3.9k Jan 5, 2023

Python Mscoco Resources

Python mscoco Libraries

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"

TensorFlow Implementation of "Show, Attend and Tell"

This repository contains the source code of our work on designing efficient CNNs for computer vision

Boundary-preserving Mask R-CNN (ECCV 2020)

Simple Baselines for Human Pose Estimation and Tracking

A PyTorch toolkit for 2D Human Pose Estimation.

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

SWA Object Detection

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Python Mscoco Resources

Related tags

Python mscoco Libraries

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"

TensorFlow Implementation of "Show, Attend and Tell"

This repository contains the source code of our work on designing efficient CNNs for computer vision

Boundary-preserving Mask R-CNN (ECCV 2020)

Simple Baselines for Human Pose Estimation and Tracking

A PyTorch toolkit for 2D Human Pose Estimation.

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

SWA Object Detection

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"