856 Repositories
Python video-retrieval Libraries
A PyTorch implementation of VIOLET
VIOLET: End-to-End Video-Language Transformers with Masked Visual-token Modeling A PyTorch implementation of VIOLET Overview VIOLET is an implementati
CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss This is official implement of "
Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU
Cross-modal Retrieval using Transformer Encoder Reasoning Networks This project reimplements the idea from "Transformer Reasoning Network for Image-Te
YouTube-Downloader - YouTube Video Downloader made using python
YouTube-Downloader YouTube Videos Downloder made using python.
CLIP2Video: Mastering Video-Text Retrieval via Image CLIP
CLIP2Video: Mastering Video-Text Retrieval via Image CLIP The implementation of paper CLIP2Video: Mastering Video-Text Retrieval via Image CLIP. CLIP2
RETRO-pytorch - Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
RETRO - Pytorch (wip) Implementation of RETRO, Deepmind's Retrieval based Attent
An official PyTorch Implementation of Boundary-aware Self-supervised Learning for Video Scene Segmentation (BaSSL)
An official PyTorch Implementation of Boundary-aware Self-supervised Learning for Video Scene Segmentation (BaSSL)
Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control.
Pose Detection Project Description: Human pose estimation from video plays a critical role in various applications such as quantifying physical exerci
Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting
Decoupled Spatial-Temporal Transformer for Video Inpainting By Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, J
Use CLIP to represent video for Retrieval Task
A Straightforward Framework For Video Retrieval Using CLIP This repository contains the basic code for feature extraction and replication of results.
Collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets. Additionally, it also collects many useful tutorials and tools in these related domains.
An anime themed telegram bot that can convert telegram media.
ShoukoKomiRobot โข ๐๐ฃ๐๐ฅ๐ฅ๐๐ ๐๐ Python3 โข ๐๐๐๐ฃ๐๐ฃ๐ช ๐๐ค๐๐ Pyrogram โข ๐๐ ๐๐ฅ๐จ๐๐ฃ๐ ๐๐ค๐๐ Ebook-convert Deploy ๐ฝ๐ ๐ฃ๐ ๐ฅ๐๐๐ค ๐ฃ
Play Video & Music on Telegram Group Video Chat
Video Stream is an Advanced Telegram Bot that's allow you to play Video & Music on Telegram Group Video Chat ๐งช Get SESSION_NAME from below: Pyrogram
YOLOPใฎPythonใงใฎONNXๆจ่ซใตใณใใซ
YOLOP-ONNX-Video-Inference-Sample YOLOPใฎPythonใงใฎONNXๆจ่ซใตใณใใซใงใใ ONNXใขใใซใฏใhustvl/YOLOP/weights ใไฝฟ็จใใฆใใพใใ Requirement OpenCV 3.4.2 or later onnxruntime 1.
A Blender addon for VSE that auto-adjusts video strip's length, if speed effect is applied.
Blender VSE Speed Adjust Addon When using Video Sequence Editor in Blender, the speed effect strip doesn't auto-adjusts clip length when changing its
Neighbourhood Retrieval (Nearest Neighbours) with Distance Correlation.
Neighbourhood Retrieval with Distance Correlation Assign Pseudo class labels to datapoints in the latent space. NNDC is a slim wrapper around FAISS. N
Multi-query Video Retreival
Multi-query Video Retreival
Continual Learning of Long Topic Sequences in Neural Information Retrieval
ContinualPassageRanking Repository for the paper "Continual Learning of Long Topic Sequences in Neural Information Retrieval". In this repository you
This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).
Motion-Focused Contrastive Learning of Video Representations Introduction This is the code for the paper "Motion-Focused Contrastive Learning of Video
This is a demo app to be used in the video streaming applications
MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks MoViDNN is an Android application that can be used to ev
Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics
Bert Axioms This is the repository with the code for the Paper Diagnosing BERT with Retrieval Heuristics Required Data In order to run this code, you
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation By Qiang Zhou*, Zilong Huang*, Lichao Huang, Han Shen, Yon
MaskTrackRCNN for video instance segmentation based on mmdetection
MaskTrackRCNN for video instance segmentation Introduction This repo serves as the official code release of the MaskTrackRCNN model for video instance
Vigia-youtube - The YouTube Watch bot is able to monitor channels on Google's video platform
Vigia do YouTube O bot Vigia do YouTube รฉ capaz de monitorar canais na plataform
GEGVL: Google Earth Based Geoscience Video Library
Google Earth Based Geoscience Video Library is transforming to Server Based. The
Video stream recording dockerized server using python/ffmpeg.
Stream Recording Server Video stream recording dockerized server using python/ffmpeg. Usage Configuration Prepare .env file, check .env.example for th
Eff video representation - Efficient video representation through neural fields
Neural Residual Flow Fields for Efficient Video Representations 1. Download MPI
YouTube Video publisher using youtube-dl & ROS2๐ข
YouTube-publisher-ROS2 Publish sensor_msgs/Image by "YouTube" ๐ค ๐ค ๐ค ! You don't have to use webcamera or your video to check demos. Purpose Quick d
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
Lbl2Vec Lbl2Vec is an algorithm for unsupervised document classification and unsupervised document retrieval. It automatically generates jointly embed
A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval
CLIP4CMR A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval The original data and pre-calculate
Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.
Video Games Web Scraper Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages. This
Depth-Aware Video Frame Interpolation (CVPR 2019)
DAIN (Depth-Aware Video Frame Interpolation) Project | Paper Wenbo Bao, Wei-Sheng Lai, Chao Ma, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang IEEE C
A platform which give you info about the newest video on a channel
youtube A platform which give you info about the newest video on a channel. This uses web scraping, a better implementation will be to use the API. BR
Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline
MUGE Multimodal Retrieval Baseline This repo is implemented based on the open_cl
Official Implementation of ReferFormer
The official implementation of the paper: Language as Queries for Referring Video Object Segmentation Language as Queries for Referring Video Object S
This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' published at ECIR'22.
Paragraph Aggregation Retrieval Model (PARM) for Dense Document-to-Document Retrieval This repository contains the code for the paper PARM: A Paragrap
A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval
CLIP4CMR A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval The original data and pre-calculate
Video lie detector using xgboost - A video lie detector using OpenFace and xgboost
video_lie_detector_using_xgboost a video lie detector using OpenFace and xgboost
A simple python script and it's used for mp4 type video downloading from youtube.
This is a simple python script and it's used for mp4 type video downloading from youtube. also, it's used inbuilt python module pytube. Furthermore, I know we have so many apps and online websites to do the same thing so it's just an experiment to study how to do those things in python.
In this project, we will be blurring the background in a live video feed
In this project, we will be blurring the background in a live video feed. This can be further integrated into online meetings, streamings etc.
Jupyter notebook and datasets from the pandas Q&A video series
Python pandas Q&A video series Read about the series, and view all of the videos on one page: Easier data analysis in Python with pandas. Jupyter Note
Course materials and handouts for #100DaysOfCode in Python course
#100DaysOfCode with Python course Course details page: talkpython.fm/100days Course Summary #100DaysOfCode in Python is your perfect companion to take
FMA: A Dataset For Music Analysis
FMA: A Dataset For Music Analysis Michaรซl Defferrard, Kirell Benzi, Pierre Vandergheynst, Xavier Bresson. International Society for Music Information
Python package for Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit comparable hash-value for any video.
The Python package for near duplicate video detection โญ๏ธ Introduction Videohash is a Python package for detecting near-duplicate videos (Perceptual Vi
A High-Quality Real Time Upscaler for Anime Video
Anime4K Anime4K is a set of open-source, high-quality real-time anime upscaling/denoising algorithms that can be implemented in any programming langua
The VarCNN is an Convolution Neural Network based approach to automate Video Assistant Referee in football.
VarCnn: The Deep Learning Powered VAR
deepstream python rtsp video h264 or gstreamer python rtsp h264 | h264
deepstream python rtsp video h264 or gstreamer python rtsp h264 | h264 deepstrea
VCPlayerBot - Telegram bot to stream videos in telegram voicechat for both groups and channels. Supports live steams, YouTube videos and telegram media
VCPlayerBot Telegram bot to stream videos in telegram voicechat for both groups
Measures input lag without dedicated hardware, performing motion detection on recorded or live video
What is InputLagTimer? This tool can measure input lag by analyzing a video where both the game controller and the game screen can be seen on a webcam
Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.
Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers. Cherche is meant to be used with small to medium sized corpora. Cherche's main strength is its ability to build diverse and end-to-end pipelines.
A curated list of the top 10 computer vision papers in 2021 with video demos, articles, code and paper reference.
The Top 10 Computer Vision Papers of 2021 The top 10 computer vision papers in 2021 with video demos, articles, code, and paper reference. While the w
Automatic Video Library Manager for TV Shows
Automatic Video Library Manager for TV Shows. It watches for new episodes of your favorite shows, and when they are posted it does its magic. Dependen
Social Fabric: Tubelet Compositions for Video Relation Detection
Social-Fabric Social Fabric: Tubelet Compositions for Video Relation Detection This repository contains the code and results for the following paper:
Code for Domain Adaptive Video Segmentation via Temporal Consistency Regularization in ICCV 2021
Domain Adaptive Video Segmentation via Temporal Consistency Regularization Updates 08/2021: check out our domain adaptation for sematic segmentation p
Implementation of Cross-category Video Highlight Detection via Set-based Learning (ICCV 2021).
Cross-category Video Highlight Detection via Set-based Learning Introduction This project is an implementation of ``Cross-category Video Highlight Det
A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.
Spatio-Temporal Dynamic Inference Network for Group Activity Recognition The source codes for ICCV2021 Paper: Spatio-Temporal Dynamic Inference Networ
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
PDVC Official implementation for End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021) [paper] [valse่ฎบๆ้้(Chinese)] This repo supports:
Video Stream: an Advanced Telegram Bot that's allow you to play Video & Music on Telegram Group Video Chat
Video Stream is an Advanced Telegram Bot that's allow you to play Video & Music
3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)
3DDUNET This is the code for 3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021) Conference Paper Link Dataset We use SMOID dataset
Video-to-GIF-Converter - A small code snippet that can be used to convert any video to a gif
Video to GIF Converter Project Description: This is a small code snippet that ca
Video Bot: an Advanced Telegram Bot that's allow you to play Video & Music on Telegram Group Video Chat
Video Bot is an Advanced Telegram Bot that's allow you to play Video & Music on
Automatic 2D-to-3D Video Conversion with CNNs
Deep3D: Automatic 2D-to-3D Video Conversion with CNNs How To Run To run this code. Please install MXNet following the official document. Deep3D requir
Automagically synchronize subtitles with video.
FFsubsync Language-agnostic automatic synchronization of subtitles with video, so that subtitles are aligned to the correct starting point within the
This is a Telegram video compress bot repo. By Binary Tech๐ซ
This is a Telegram Video Compress Bot. Prouduct By Binary Tech ๐ซ Features Compresse videos and generate screenshots too.You can set custom video name
This is a Telegram video compress bot repo. By Binary Tech๐ซ
This is a Telegram Video Compress Bot. Prouduct By Binary Tech ๐ซ Features Compresse videos and generate screenshots too.You can set custom video name
Cvplayer - A simple video player written in python using ffpyplayer and OpenCV
Video Player cvplayer is a minimal wrapper around the ffpyplayer.MediaPlayer cla
MoCoPnet - Deformable 3D Convolution for Video Super-Resolution
Deformable 3D Convolution for Video Super-Resolution Pytorch implementation of l
This is a TG Video Compress BoT. Product by BINARY Tech
๐ Video Compressor Bot Product by BINARY Tech Deploy to Heroku The Hard Way virtualenv -p python3 VENV . ./VENV/bin/activate pip install -r requireme
Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022
TripClick Baselines with Improved Training Data Welcome ๐ to the hub-repo of our paper: Establishing Strong Baselines for TripClick Health Retrieval
Referring Video Object Segmentation
Awesome-Referring-Video-Object-Segmentation Welcome to starts โญ & comments ๐น & sharing ๐ !! - 2021.12.12: Recent papers (from 2021) - welcome to ad
Meteor scan - Scan through video for meteor
meteor_scan Scan through video for meteor Installation Install python packages b
ReferFormer - Official Implementation of ReferFormer
The official implementation of the paper: Language as Queries for Referring Vide
VideocompBot - This is TG Video Compress BoT. Prouduct By BINARY Tech ๐ซ
VideocompBot - This is TG Video Compress BoT. Prouduct By BINARY Tech ๐ซ
Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".
I2V-GAN This repository is the official Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation". Traffic
Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.
FixMyPose / เคซเคฟเคเฅเคธเคฎเคพเคเคชเฅเฅ Code and dataset for AAAI 2021 paper "FixMyPose: Pose Correctional Describing and Retrieval" Hyounghun Kim*, Abhay Zala*, Grah
Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning
Semantic Grouping Network for Video Captioning Hobin Ryu, Sunghun Kang, Haeyong Kang, and Chang D. Yoo. AAAI 2021. [arxiv] Environment Ubuntu 16.04 CU
PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning PyTorch code for our ACL 2020 paper "MART: Memory-Augmented Recur
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
Awesome-Visual-Captioning Table of Contents ACL-2021 CVPR-2021 AAAI-2021 ACMMM-2020 NeurIPS-2020 ECCV-2020 CVPR-2020 ACL-2020 AAAI-2020 ACL-2019 NeurI
Stinky ID - A stable pluggable Telegram userbot + Voice & Video Call music bot, based on Telethon
Ultroid - UserBot A stable pluggable Telegram userbot + Voice & Video Call music
Tensorflow implementation of soft-attention mechanism for video caption generation.
SA-tensorflow Tensorflow implementation of soft-attention mechanism for video caption generation. An example of soft-attention mechanism. The attentio
Code release for Convolutional Two-Stream Network Fusion for Video Action Recognition
Convolutional Two-Stream Network Fusion for Video Action Recognition
Telegram Music/ Video Streaming Bot Using Pytgcalls
Video Player ๐ฅ แดขแดษชแด แด แด แดสแดyแดส ษช๊ฑ แด แดแดสแดษขสแดแด แดสแดแดแดแดแด สแด๊ฑแดแด แดษด แดสสแดษขสแดแด ๊ฐแดส แดสแดส แดแด๊ฑษชแด๊ฑ ษชษด แด แด แดสแดแด๊ฑ... ๐ ก๐ ๐ ๐ ๐ ข๐ ฃ๐ ๐ ฃ๐ ข สแดQแดษชสแดแดแดษดแด๊ฑ ๐ FFmpeg NodeJ
A video scene detection algorithm is designed to detect a variety of different scenes within a video
Scene-Change-Detection - A video scene detection algorithm is designed to detect a variety of different scenes within a video. There is a very simple definition for a scene: It is a series of logically and chronologically related shots taken in a specific order to depict an over-arching concept or story.
DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6
่ถณ็่ง้ขๅๆ็ณป็ป ไฝ่ ้ๅพไธ [email protected] ๆนๅคฉๅฎฌ [email protected] ็ฎไป ๆฌ้กน็ฎๆฏSJTU 21-22ๅญฆๅนดCS386 ๆฐๅญๅพๅๅค็่ฏพ็จ็ๅคงไฝไธ๏ผๆฌๆๆฏ่ถณ็่ง้ขๅๆ็ณป็ป็ๅ่ๆๆกฃใๆไปฌไธป่ฆๅฎ็ฐไบไปฅไธๅ่ฝ๏ผ ๅบไบYolo v5ๅPastaNetๆญๅปบไบ่ถณ็่ง้ข็
Doods2 - API for detecting objects in images and video streams using Tensorflow
DOODS2 - Return of DOODS Dedicated Open Object Detection Service - Yes, it's a b
Video-Player - Telegram Music/ Video Streaming Bot Using Pytgcalls
Video Player ๐ฅ แดขแดษชแด แด แด แดสแดyแดส ษช๊ฑ แด แดแดสแดษขสแดแด แดสแดแดแดแดแด สแด๊ฑแดแด แดษด แดสสแดษขสแดแด ๊ฐแดส แดสแดส
Ffmpeg videostream - High speed video frame access in Python, using FFmpeg and FFshow
FFmpeg VideoStream High speed video frame access in Python, using FFmpeg and FFshow This script requires: Karl Kroening's 'ffmpeg-python' library. (ht
Video stream image stacking -- live version
video stream image stacking v2 -- live version A very simple streamed video image stacking code! Version 2.1 left mouse click to select a small region
๐ด ๐ก๐๐๐๐๐๐๐ ๐๐๐ก ๐กโ๐๐ก ๐๐๐ ๐๐๐ค๐๐๐๐๐ ๐ฃ๐๐๐๐ ๐๐๐ ๐๐ข๐๐๐ ๐๐๐๐ ๐ฆ๐๐ข๐ก๐ข๐๐ ๐๐๐ ๐ฃ๐๐๐๐ ๐ค๐๐๐ ๐๐ก๐๐ ๐๐ข๐๐๐๐๐ฆ
๐ด ๐ก๐๐๐๐๐๐๐ ๐๐๐ก ๐กโ๐๐ก ๐๐๐ ๐๐๐ค๐๐๐๐๐ ๐ฃ๐๐๐๐ ๐๐๐ ๐๐ข๐๐๐ ๐๐๐๐ ๐ฆ๐๐ข๐ก๐ข๐๐ ๐๐๐ ๐ฃ๐๐๐๐ ๐ค๐๐๐ ๐๐ก๐๐ ๐๐ข๐๐๐๐๐ฆ
Video2x - A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR.
Official Discussion Group (Telegram): https://t.me/video2x A Discord server is also available. Please note that most developers are only on Telegram.
Autosub - Command-line utility for auto-generating subtitles for any video file
Auto-generated subtitles for any video Autosub is a utility for automatic speech recognition and subtitle generation. It takes a video or an a
Repositรณrio criado para abrigar os notebooks com a listas de exercรญcios propostos pelo professor Gustavo Guanabara do canal Curso em Vรญdeo do YouTube durante o Curso de Python 3
Curso em Vรญdeo - Exercรญcios de Python 3 Sobre o repositรณrio Este repositรณrio contรฉm os notebooks com a listas de exercรญcios propostos pelo professor G
Videocaptioning.pytorch - A simple implementation of video captioning
pytorch implementation of video captioning recommend installing pytorch and pyth
Vector space based Information Retrieval System for Text Processing - Information retrieval
Information Retrieval: Text Processing Group 13 Sequence of operations Install Requirements Add given wikipedia files to the corpus directory. Downloa
YouTube-Video-Downloader - Download Youtube Videos for free.
YouTube-Video-Downloader Download Youtube Videos for free. Installing Dependencies:- Windows pip install pytube Mac/Linux pip3 install pytube Clonin
Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video
ๅๆฉ face_recognitionใไฝฟ็จใใฆๅพใใใ้กๅบงๆจใฏ้ทๆนๅฝขใงใใใใใฎๅบงๆจใใใฎใพใพ็จใใฆใผใใๅฆ็ใ่กใฃใๅ ดๅๅพใใใ็ปๅใฏ้ใใ ใใใซๅฏพใใฆใข
A Survey on Deep Learning Technique for Video Segmentation
A Survey on Deep Learning Technique for Video Segmentation A Survey on Deep Learning Technique for Video Segmentation Wenguan Wang, Tianfei Zhou, Fati
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation.
Training Script for Reuse-VOS This code implementation of CVPR 2021 paper : Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Vi
CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing
CapsuleVOS This is the code for the ICCV 2019 paper CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing. Arxiv Link: https://a
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval, CVPR 2021. Ayan Kumar Bhunia, Pinaki nath Chowdh