12 Python Multimodality Libraries

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

package tests docs license stats support This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML

National Center for Cognitive Research of ITMO University

482 Dec 26, 2022

X-VLM: Multi-Grained Vision Language Pre-Training

X-VLM: learning multi-grained vision language alignments Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts. Yan Zeng, Xi

286 Dec 23, 2022

A knowledge base construction engine for richly formatted data

Fonduer is a Python package and framework for building knowledge base construction (KBC) applications from richly formatted data. Note that Fonduer is

386 Dec 5, 2022

Repo for the ACMMM20 submission: "Personalized breath based biometric authentication with wearable multimodality".

personalized-breath Repo for the ACMMM20 submission: "Personalized breath based biometric authentication with wearable multimodality". Guideline To ex

2 Nov 15, 2021

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

VectorNet Re-implementation This is the unofficial pytorch implementation of CVPR2020 paper "VectorNet: Encoding HD Maps and Agent Dynamics from Vecto

120 Jan 6, 2023

GluonMM is a library of transformer models for computer vision and multi-modality research

GluonMM is a library of transformer models for computer vision and multi-modality research. It contains reference implementations of widely adopted baseline models and also research work from Amazon Research.

42 Dec 2, 2022

Generate vibrant and detailed images using only text.

CLIP Guided Diffusion From RiversHaveWings. Generate vibrant and detailed images using only text. See captions and more generations in the Gallery See

401 Dec 28, 2022

Automated modeling and machine learning framework FEDOT

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML). It can build custom modeling pipelines for different real-world processes in an automated way using an evolutionary approach. FEDOT supports classification (binary and multiclass), regression, clustering, and time series prediction tasks.

148 Jul 5, 2021

Python Multimodality Resources

Python multimodality Libraries

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

X-VLM: Multi-Grained Vision Language Pre-Training

A knowledge base construction engine for richly formatted data

Repo for the ACMMM20 submission: "Personalized breath based biometric authentication with wearable multimodality".

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

GluonMM is a library of transformer models for computer vision and multi-modality research

Generate vibrant and detailed images using only text.

Automated modeling and machine learning framework FEDOT

Sequence-to-Sequence Framework in PyTorch

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN.

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

Python Multimodality Resources

Related tags

Python multimodality Libraries

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

X-VLM: Multi-Grained Vision Language Pre-Training

A knowledge base construction engine for richly formatted data

Repo for the ACMMM20 submission: "Personalized breath based biometric authentication with wearable multimodality".

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

GluonMM is a library of transformer models for computer vision and multi-modality research

Generate vibrant and detailed images using only text.

Automated modeling and machine learning framework FEDOT

Sequence-to-Sequence Framework in PyTorch

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN.

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN