A demo for end-to-end English and Chinese text spotting using ABCNet.

Yuliang Liu

Last update: Oct 4, 2022

Related tags

Text Data & NLP ABCNet_Chinese

Overview

ABCNet_Chinese

A demo for end-to-end English and Chinese text spotting using ABCNet. This is an old model that was trained a long ago, which serves as a base setting for others to train their own model on Chinese or other language. Official ABCNet_v2 models will be updated in AdelaiDet.

Installation

Install detectron2 using the provided version (support visualizing Chinese text):

python -m pip install -e d2

Install this repo:

python setup.py build develop

If the above succeed, you can now run the demo using the provided model.

Model

This is our model that can be used for evaluation or pretraining.

wget https://drive.google.com/file/d/1iWX2n_BmyltVwQmfj8_oM9z7cJlq1P0m/view?usp=sharing -O model_chn.pth

Simply put the model in the root directory of the repo.

Demo

bash demo.sh

Example results

If you successfully run the demo, you will get the output below:

Other results (same project but not using the provide model):

Document-like Ancient words, e.g., “彝文”:

Cite

If you find this repo useful, please cite:

@article{liu2021abcnet,
  title={ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting},
  author={Liu, Yuliang and Shen, Chunhua and Jin, Lianwen and He, Tong and Chen, Peng and Liu, Chongyu and Chen, Hao},
  journal={arXiv preprint arXiv:2105.03620},
  year={2021}
}

Data

We provide the converted json files of ArT, LSVT, and ReCTS that we have used for training ABCNet_Chinese.

ReCTs [images&label](1.7G) [Origin_of_dataset]
LSVT [images&label](8.2G) [Origin_of_dataset]
ArT [images&label](1.5G) [Origin_of_dataset]
SynChinese130k [images&label](25G) [Origin_of_dataset]

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact Chunhua Shen.

Comments

chn_cls_list.txt

hello, Thank you for sharing. when I run the demo, the result can't show recognition text. I found the encoding file(chn_cls_list.txt) shows the messy text, I have installed Chinese fonts, but it doesn't display properly in Ubuntu or windows systems. can you help me?

opened by ChadPieere 0

PyTorch implementation of Microsoft's text-to-speech system FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

1k Dec 30, 2022

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Text to speech (using Python) Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and co

19 Jun 30, 2022

Text editor on python to convert english text to malayalam(Romanization/Transiteration).

Manglish Text Editor This is a simple transiteration (romanization ) program which is used to convert manglish to malayalam (converts njaan to ഞാൻ ).

1 May 11, 2022

Text editor on python tkinter to convert english text to other languages with the help of ployglot.

Transliterator Text Editor This is a simple transliteration program which is used to convert english word to phonetically matching word in another lan

1 Jan 16, 2022

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

End to end text to speech system using gruut and onnx

673 Dec 28, 2022

Input english text, then translate it between languages n times using the Deep Translator Python Library.

mass-translator About Input english text, then translate it between languages n times using the Deep Translator Python Library. How to Use Install dep

2 Mar 4, 2022

STS Benchmark comprises a selection of the English datasets used in the STS tasks organized in the context of SemEval between 2012 and 2017. The selection of datasets include text from image captions, news headlines and user forums.

stsb_multi_mt_en STS Benchmark comprises a selection of the English datasets used in the STS tasks organized in the context of SemEval between 2012 an

2 Nov 5, 2021

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

CRNN paper：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 1. create your ow

3 Apr 2, 2022

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Text-Summarization-using-NLP Text Summarization using NLP to fetch BBC News Arti

21 Aug 6, 2022

A demo for end-to-end English and Chinese text spotting using ABCNet.

Related tags

Overview

ABCNet_Chinese

Installation

Model

Demo

Example results

Cite

Data

License

You might also like...

PyTorch implementation of Microsoft's text-to-speech system FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Text editor on python to convert english text to malayalam(Romanization/Transiteration).

Text editor on python tkinter to convert english text to other languages with the help of ployglot.

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

Input english text, then translate it between languages n times using the Deep Translator Python Library.

STS Benchmark comprises a selection of the English datasets used in the STS tasks organized in the context of SemEval between 2012 and 2017. The selection of datasets include text from image captions, news headlines and user forums.

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Comments

chn_cls_list.txt

Owner

Yuliang Liu

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Chinese real time voice cloning (VC) and Chinese text to speech (TTS).

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

A unified tokenization tool for Images, Chinese and English.

A Chinese to English Neural Model Translation Project

A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.

vits chinese, tts chinese, tts mandarin

A demo of chinese asr

Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks