Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Last update: Dec 9, 2022

Related tags

Deep Learning CiSIN

Overview

Character in Story Identification Network (CiSIN)

This project hosts the code for our paper.

Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung and Gunhee Kim. Character Grounding and Re-Identification inStory of Videos and Text Descriptions. In ECCV (spotlight), 2020.

This project is an Winning Solution in LSMDC 19 "Fill-in the Characters" task. For more information about the LSMDC visit the Large Scale Movie Description Challenge (LSMDC) 2019

Reference

If you use this code as part of any published research, please refer following paper,

@inproceedings{yu:2020:ECCV,
    title="{Character Grounding and Re-Identification inStory of Videos and Text Descriptions}",
    author={Yu, Youngjae and Kim, Jongseok and Yun, Heeseung and Chung Jiwan and Kim, Gunhee},
    booktitle={ECCV},
    year=2020
}

System Requirements

The following dependencies should be installed:

Python 3.6
Pytorch 1.4.0
torchvision 0.5.0
CUDA 10.0 supported GPU with at least 12GB memory
see requirements.txt for more details

Data Setup

Coming soon,

CiSIN

To train our model,

python train.py

Acknowledgement

We thank SNUVL lab members for helpful comments. This research was supported by Seoul National University, Brain Research Program by National Research Foundation of Korea (NRF) (2017M3C7A1047860), and AIR Lab (AI Research Lab) in Hyundai Motor Company through HMC-SNU AI Consortium Fund.

License

LICENSE.md.

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]

piglet PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021] This repo contains code and data for PIGLeT. If you like

51 Oct 8, 2022

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

LBYL-Net This repo implements paper Look Before You Leap: Learning Landmark Features For One-Stage Visual Grounding CVPR 2021. Getting Started Prerequ

45 Dec 12, 2022

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

52 Dec 19, 2022

The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.

Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation This repository is the official implementation of CVPR 2021 paper:

9 Nov 14, 2022

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Related tags

Overview

Character in Story Identification Network (CiSIN)

Reference

System Requirements

Data Setup

CiSIN

Acknowledgement

License

You might also like...

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

SeqTR: A Simple yet Universal Network for Visual Grounding

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

Owner

Add-on for importing and auto setup of character creator 3 character exports.

a pytorch implementation of auto-punctuation learned character by character

a pytorch implementation of auto-punctuation learned character by character

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

Train emoji embeddings based on emoji descriptions.

Official PyTorch implementation of the paper "TEMOS: Generating diverse human motions from textual descriptions"

The story of Chicken for Club Bing

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)