Multimodal commodity image retrieval 多模态商品图像检索

hongjie

Last update: Nov 25, 2022

Related tags

Deep Learning multimodel

Overview

Multimodal commodity image retrieval

多模态商品图像检索

Not finished yet...

introduce

explain:The specific description of the project and the product image data set will be supplemented in the future. Welcome to star in advance

使用商品图像数据集的检索结果mAP

CD 商品图像数据集 (https://cs.hrbcu.edu.cn/info/1267/1416.htm)

并提供二进制文件(https://drive.google.com/drive/folders/1Ch3Y9Tek5MQyXLYeJpWQ1oe_YcwNf5c_?usp=sharing)

Fasion-200k

需要初始化path和label_path,运行fasion_dataset.py将会得到训练集和测试集的图片路径，所有过滤后的文本数据以及标签(https://www.kaggle.com/mayukh18/fashion200k-dataset)

in addition

python main.py

所有需要的包都在requirments.txt, 代码中包含了众多注释，你可以在其中发现他们 All required packages are in requirements.txt The code contains many comments, which you can find in them

如果觉得还错欢迎star

If you have any questions, please contact me

cisip-FIRe - Fast Image Retrieval

Fast Image Retrieval (FIRe) is an open source image retrieval project release by Center of Image and Signal Processing Lab (CISiP Lab), Universiti Malaya. This project implements most of the major binary hashing methods to date, together with different popular backbone networks and public datasets.

39 Nov 25, 2022

Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Retrieval.

Targeted Trojan-Horse Attacks on Language-based Image Retrieval Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Re

7 Aug 23, 2022

A Comparative Framework for Multimodal Recommender Systems

Cornac Cornac is a comparative framework for multimodal recommender systems. It focuses on making it convenient to work with models leveraging auxilia

671 Jan 3, 2023

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

TransBTS: Multimodal Brain Tumor Segmentation Using Transformer This repo is the official implementation for TransBTS: Multimodal Brain Tumor Segmenta

247 Dec 28, 2022

Deep Multimodal Neural Architecture Search

MMNas: Deep Multimodal Neural Architecture Search This repository corresponds to the PyTorch implementation of the MMnas for visual question answering

23 Dec 21, 2022

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

XDVioDet Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020. The proj

64 Dec 12, 2022

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos. By adopting a unified pipeline-based API design, PyKale enforces standardization and minimalism, via reusing existing resources, reducing repetitions and redundancy, and recycling learning models across areas.

370 Dec 27, 2022

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction This is the code for the paper Combining E

69 Dec 26, 2022

MERLOT: Multimodal Neural Script Knowledge Models

merlot MERLOT: Multimodal Neural Script Knowledge Models MERLOT is a model for learning what we are calling "neural script knowledge" -- representatio

190 Dec 22, 2022

Multimodal commodity image retrieval 多模态商品图像检索

Related tags

Overview

Multimodal commodity image retrieval

多模态商品图像检索

introduce

CD 商品图像数据集 (https://cs.hrbcu.edu.cn/info/1267/1416.htm)

Fasion-200k

in addition

如果觉得还错欢迎star

You might also like...

cisip-FIRe - Fast Image Retrieval

Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Retrieval.

A Comparative Framework for Multimodal Recommender Systems

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

Deep Multimodal Neural Architecture Search

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

MERLOT: Multimodal Neural Script Knowledge Models

Owner

hongjie

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Rethinking the U-Net architecture for multimodal biomedical image segmentation

Framework for joint representation learning, evaluation through multimodal registration and comparison with image translation based approaches

Activity image-based video retrieval

Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback

A Joint Video and Image Encoder for End-to-End Retrieval

Code for 'Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning', ICCV 2021

Instance-level Image Retrieval using Reranking Transformers

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking