The Easy-to-use Dialogue Response Selection Toolkit for Researchers

GMFTBY

Last update: Nov 13, 2022

Related tags

Third-party APIs Wrappers SimpleReDial-v1

Overview

Easy-to-use toolkit for retrieval-based Chatbot

Our released data can be found at this link. Make sure the following steps are adopted to use our codes.

How to Use

Init the repo

Before using the repo, please run the following command to init:

# create the necessay folders
python init.py

# prepare the environment
# if some package cannot be installed, just google and install it from other ways
pip install -r requirements.txt

train the model

./scripts/train.sh <dataset_name> <model_name> <cuda_ids>

test the model [rerank]

./scripts/test_rerank.sh <dataset_name> <model_name> <cuda_id>

test the model [recal]

# different recall_modes are available: q-q, q-r
./scripts/test_recall.sh <dataset_name> <model_name> <cuda_id>

inference the responses and save into the faiss index

Somethings inference will missing data samples, please use the 1 gpu (faiss-gpu search use 1 gpu quickly)

It should be noted that: 1. For writer dataset, use extract_inference.py script to generate the inference.txt 2. For other datasets(douban, ecommerce, ubuntu), just cp train.txt inference.txt. The dataloader will automatically read the test.txt to supply the corpus.

# work_mode=response, inference the response and save into faiss (for q-r matching) [dual-bert/dual-bert-fusion]
# work_mode=context, inference the context to do q-q matching
# work_mode=gray, inference the context; read the faiss(work_mode=response has already been done), search the topk hard negative samples; remember to set the BERTDualInferenceContextDataloader in config/base.yaml
./scripts/inference.sh <dataset_name> <model_name> <cuda_ids>

If you want to generate the gray dataset for the dataset:

# 1. set the mode as the **response**, to generate the response faiss index; corresponding dataset name: BERTDualInferenceDataset;
./scripts/inference.sh <dataset_name> response <cuda_ids>

# 2. set the mode as the **gray**, to inference the context in the train.txt and search the top-k candidates as the gray(hard negative) samples; corresponding dataset name: BERTDualInferenceContextDataset
./scripts/inference.sh <dataset_name> gray <cuda_ids>

# 3. set the mode as the **gray-one2many** if you want to generate the extra positive samples for each context in the train set, the needings of this mode is the same as the **gray** work mode
./scripts/inference.sh <dataset_name> gray-one2many <cuda_ids>

If you want to generate the pesudo positive pairs, run the following commands:

# make sure the dual-bert inference dataset name is BERTDualInferenceDataset
./scripts/inference.sh <dataset_name> unparallel <cuda_ids>

deploy the rerank and recall model

# load the model on the cuda:0(can be changed in deploy.sh script)
./scripts/deploy.sh <cuda_id>

at the same time, you can test the deployed model by using:

# test_mode: recall, rerank, pipeline
./scripts/test_api.sh <test_mode> <dataset>

test the recall performance of the elasticsearch

Before testing the es recall, make sure the es index has been built:

# recall_mode: q-q/q-r
./scripts/build_es_index.sh <dataset_name> <recall_mode>

# recall_mode: q-q/q-r
./scripts/test_es_recall.sh <dataset_name> <recall_mode> 0

simcse generate the gray responses

# train the simcse model
./script/train.sh <dataset_name> simcse <cuda_ids>

# generate the faiss index, dataset name: BERTSimCSEInferenceDataset
./script/inference_response.sh <dataset_name> simcse <cuda_ids>

# generate the context index
./script/inference_simcse_response.sh <dataset_name> simcse <cuda_ids>
# generate the test set for unlikelyhood-gen dataset
./script/inference_simcse_unlikelyhood_response.sh <dataset_name> simcse <cuda_ids>

# generate the gray response
./script/inference_gray_simcse.sh <dataset_name> simcse <cuda_ids>
# generate the test set for unlikelyhood-gen dataset
./script/inference_gray_simcse_unlikelyhood.sh <dataset_name> simcse <cuda_ids>

Easy way to use Telegram bot to hide your identity.

telegram-support-bot Easy way to use Telegram bot to hide your identity. Useful for support, anonymous channel management. Free clone of Livegram Bot.

197 Dec 23, 2022

A Simple, Easy to use and light-weight Pyrogram Userbot

Nexa Userbot A Simple, Easy to use and light-weight Pyrogram Userbot Deploy With Heroku With VPS (Local) Clone Nexa-Userbot repository git clone https

28 Nov 12, 2022

A modern, easy to use, feature-rich, and async ready API wrapper improved and revived from original discord.py.

A Python API wrapper that is improved and revived from the original discord.py

19 Nov 6, 2021

A PowerPacked Version Of Telegram Leech Bot With Modern Easy-To-Use Interface & UI !

FuZionX Leech Bot A Powerful Telegram Leech Bot Modded by MysterySD to directly Leech to Telegram, with Multi Direct Links Support for Enhanced Leechi

28 Oct 9, 2022

Easy to use phishing tool with 63 website templates. Author is not responsible for any misuse.

PyPhisher [+] Created By KasRoudra [+] Description : Ultimate phishing tool in python. Includes popular websites like facebook, twitter, instagram, gi

1.1k Jan 1, 2023

This is a very easy to use tool developed in python that will search for free courses from multiple sites including youtube and enroll in the ones in which it can.

Free-Course-Hunter-and-Enroller This is a very easy to use tool developed in python that will search for free courses from multiple sites including yo

12 Nov 12, 2022

Python CMR is an easy to use wrapper to the NASA EOSDIS Common Metadata Repository API.

This repository is a copy of jddeal/python_cmr which is no longer maintained. It has been copied here with the permission of the original author for t

9 Nov 16, 2022

A modern, easy to use, feature-rich, and async ready API wrapper for Discord written in Python.

A modern, easy to use, feature-rich, and async ready API wrapper for Discord written in Python. Key Features Modern Pythonic API using async and await

4 Nov 5, 2021

ChairBot is designed to be reliable, easy to use, and lightweight for every user, and easliy to code add-ons for ChairBot.

ChairBot is designed to be reliable, easy to use, and lightweight for every user, and easliy to code add-ons for ChairBot. Ready to see whats possible with ChairBot?

1 Nov 8, 2021

Comments

关于复现 DR-BERT 结果没论文里好的问题

作者你好，非常感谢你的工作。我最近在复现 dense retrieval 发现结果不如论文里的号。具体配置：加载哈工大的 roberta，batch_size 64，epochs_num 5， grad_clip 5.0，learning_rate 5e-5，max_len 分别为 256, 64；使用 ECD 数据，测试集上所有数值均比论文里低一些，请问有什么纰漏吗？

opened by wulaoshi 7
Question about interaction layer

I am trying to find how interaction layer is implemented. But I had no success.. could you help me with interaction layer?

I searched for it in model > InteractionModels and there seems to be no code for model architecture. I read your paper and found out that the interaction layer composes of transformer decoder. How can I implement this?

opened by minji2744 2

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

Related tags

Overview

Easy-to-use toolkit for retrieval-based Chatbot

How to Use

You might also like...

Easy way to use Telegram bot to hide your identity.

A Simple, Easy to use and light-weight Pyrogram Userbot

A modern, easy to use, feature-rich, and async ready API wrapper improved and revived from original discord.py.

A PowerPacked Version Of Telegram Leech Bot With Modern Easy-To-Use Interface & UI !

Easy to use phishing tool with 63 website templates. Author is not responsible for any misuse.

This is a very easy to use tool developed in python that will search for free courses from multiple sites including youtube and enroll in the ones in which it can.

Python CMR is an easy to use wrapper to the NASA EOSDIS Common Metadata Repository API.

A modern, easy to use, feature-rich, and async ready API wrapper for Discord written in Python.

ChairBot is designed to be reliable, easy to use, and lightweight for every user, and easliy to code add-ons for ChairBot.

Comments

关于复现 DR-BERT 结果没论文里好的问题

Question about interaction layer

Owner

GMFTBY

A Telegram Repo For Devs To Controll The Bots Under Maintenance.This Bot Is For Developers, If Your Bot Is Down, Use This Repo To Give Your Dear Subscribers Some Support By Providing Them Response.

The Research PACS on AWS solution facilitates researchers' access medical images stored in the clinical PACS in a secure and seamless manner

Basic-Discord-Response-Bot, in Python

AWS Lambda - Parsing Cloudwatch Data and sending the response via email.

Wakatime Response In javascript and python

PyFIR - Python implementations of Finite Impulse Response (FIR) filters

Aws-lambda-requests-wrapper - Request/Response wrapper for AWS Lambda with API Gateway

Project developed as part of a selection process for the company Denox

Easy-apply-bot - A LinkedIn Easy Apply bot to help with my job search.

🤖 A fully featured, easy to use Python wrapper for the Walmart Open API