26 Python Caption Libraries

Gif-caption - A straightforward GIF Captioner written in Python

Broksy's GIF Captioner Have you ever wanted to easily caption a GIF without havi

3 Apr 9, 2022

To create a deep learning model which can explain the content of an image in the form of speech through caption generation with attention mechanism on Flickr8K dataset.

0 Feb 8, 2022

Automagically synchronize subtitles with video.

FFsubsync Language-agnostic automatic synchronization of subtitles with video, so that subtitles are aligned to the correct starting point within the

5.7k Jan 6, 2023

Automatic caption evaluation metric based on typicality analysis.

SeMantic and linguistic UndeRstanding Fusion (SMURF) Automatic caption evaluation metric described in the paper "SMURF: SeMantic and linguistic UndeRs

6 Jan 9, 2022

A unified framework to jointly model images, text, and human attention traces.

connect-caption-and-trace This repository contains the reference code for our paper Connecting What to Say With Where to Look by Modeling Human Attent

73 Oct 24, 2022

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Scan2Cap: Context-aware Dense Captioning in RGB-D Scans Introduction We introduce the task of dense captioning in 3D scans from commodity RGB-D sensor

79 Nov 7, 2022

Code accompanying the paper Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs (Chen et al., CVPR 2020, Oral).

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs This repository contains PyTorch implementation of our pa

178 Dec 29, 2022

Meshed-Memory Transformer for Image Captioning. CVPR 2020

M²: Meshed-Memory Transformer This repository contains the reference code for the paper Meshed-Memory Transformer for Image Captioning (CVPR 2020). Pl

422 Dec 28, 2022

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

30 Oct 28, 2022

Tensorflow implementation of soft-attention mechanism for video caption generation.

SA-tensorflow Tensorflow implementation of soft-attention mechanism for video caption generation. An example of soft-attention mechanism. The attentio

153 Nov 14, 2022

Image captioning - Tensorflow implementation of Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Introduction This neural system for image captioning is roughly based on the paper "Show, Attend and Tell: Neural Image Caption Generation with Visual

749 Dec 28, 2022

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Conceptual 12M We introduce the Conceptual 12M (CC12M), a dataset with ~12 million image-text pairs meant to be used for vision-and-language pre-train

226 Dec 7, 2022

Neural Caption Generator with Attention

Neural Caption Generator with Attention Tensorflow implementation of "Show

510 Nov 30, 2022

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

FENSE The metric, Fluency ENhanced Sentence-bert Evaluation (FENSE), for audio caption evaluation, proposed in the paper "Can Audio Captions Be Evalua

13 Dec 23, 2022

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Faster R-CNN pretrained on VisualGenome This repository modifies maskrcnn-benchmark for object detection and attribute prediction on VisualGenome data

7 Apr 20, 2021

Yet another video caption

5 May 26, 2022

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Official code for our Interspeech 2021 - Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset [1]*. Visually-grounded spoken language datasets c

3 Jan 26, 2022

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Official code for our Interspeech 2021 - Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset [1]*. Visually-grounded spoken language datasets c

3 Jan 26, 2022

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

VisualGPT Our Paper VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning Main Architecture of Our VisualGPT Downloa

140 Dec 28, 2022

A Telegram Bot for adding Footer caption beside main caption of Telegram Channel Messages.

Footer-Bot A Telegram Bot for adding Footer caption beside main caption of Telegram Channel Messages. Best for Telegram Movie Channels. Made by @AbirH

35 Jan 2, 2023

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

TAP: Text-Aware Pre-training TAP: Text-Aware Pre-training for Text-VQA and Text-Caption by Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Flo

61 Nov 14, 2022

Advance Anonymous Sender bot with Caption Editor

AnonyMous Sender 👨‍💻 Advanced Anonymous Sender with Caption Editor Join @DaisySupport_Official 🎵 for help Features Get forwarded messages without f

13 Oct 9, 2022

A simple Telegram bot that can add caption to any media on your channel

Channel Auto Caption This bot can add a caption for any media/document sent to a channel. Just deploy bot and add bot as admin to a channel. Deploy to

22 Nov 14, 2022

A Telegram bot that add a dynamic caption to musics

Music Channel Manager A Telegram bot that add a dynamic caption to musics Deploy to Heroku What is it ? It manage your music channel. With just adding

13 Oct 18, 2022

Generate vector graphics from a textual caption

VectorAscent: Generate vector graphics from a textual description Example "a painting of an evergreen tree" python text_to_painting.py --prompt "a pai

97 Dec 15, 2022

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

CLIP-GLaSS Repository for the paper Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search An in-browser demo is

172 Dec 22, 2022

Python Caption Resources

Python caption Libraries

Gif-caption - A straightforward GIF Captioner written in Python

To create a deep learning model which can explain the content of an image in the form of speech through caption generation with attention mechanism on Flickr8K dataset.

Automagically synchronize subtitles with video.

Automatic caption evaluation metric based on typicality analysis.

A unified framework to jointly model images, text, and human attention traces.

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Code accompanying the paper Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs (Chen et al., CVPR 2020, Oral).

Meshed-Memory Transformer for Image Captioning. CVPR 2020

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

Tensorflow implementation of soft-attention mechanism for video caption generation.

Image captioning - Tensorflow implementation of Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Neural Caption Generator with Attention

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Yet another video caption

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

A Telegram Bot for adding Footer caption beside main caption of Telegram Channel Messages.

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

Advance Anonymous Sender bot with Caption Editor

A simple Telegram bot that can add caption to any media on your channel

A Telegram bot that add a dynamic caption to musics

Generate vector graphics from a textual caption

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

Python Caption Resources

Related tags

Python caption Libraries

Gif-caption - A straightforward GIF Captioner written in Python

To create a deep learning model which can explain the content of an image in the form of speech through caption generation with attention mechanism on Flickr8K dataset.

Automagically synchronize subtitles with video.

Automatic caption evaluation metric based on typicality analysis.

A unified framework to jointly model images, text, and human attention traces.

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Code accompanying the paper Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs (Chen et al., CVPR 2020, Oral).

Meshed-Memory Transformer for Image Captioning. CVPR 2020

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

Tensorflow implementation of soft-attention mechanism for video caption generation.

Image captioning - Tensorflow implementation of Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Neural Caption Generator with Attention

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Yet another video caption

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

A Telegram Bot for adding Footer caption beside main caption of Telegram Channel Messages.

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

Advance Anonymous Sender bot with Caption Editor

A simple Telegram bot that can add caption to any media on your channel

A Telegram bot that add a dynamic caption to musics

Generate vector graphics from a textual caption

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search