This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

Dafang He

Last update: Sep 10, 2022

Related tags

Computer Vision deep-learning deployment scene-text end-to-end-ocr scene-text-recognition scene-text-detection

Overview

DeepSceneTextReader

This is a c++ project deploying a deep scene text reading pipeline. It reads text from natural scene images.

Prerequsites

The project is written in c++ using tensorflow computational framework. It is tested using tensorflow 1.4. Newer version should be ok too, but not tested. Please install:

Tensorflow
nsync project: https://github.com/google/nsync.git This is needed for building tensorflow.
opencv3.3
protobuf
eigen

Please check this project on how to build project using tensorflow with cmake: https://github.com/cjweeks/tensorflow-cmake It greatly helped the progress of building this project. When building tensorflow library, please be careful since we need to use opencv. Looks like there is still problem when including tensorflow and opencv together. It will make opencv unable to read image. Check out this issue: https://github.com/tensorflow/tensorflow/issues/14267 The answer by allenlavoie solved my problem, so I paste it here:

"In the meantime, as long as you're not using any custom ops you can build libtensorflow_cc.so with bazel build --config=monolithic, which will condense everything together into one shared object (no libtensorflow_framework dependence) and seal off non-TensorFlow symbols. That shared object will have protocol buffer symbols."

Status

Currently two pretrained model is provided. One for scene text detection, and one for scene text recognition. More model will be provided. Note that the current model is not so robust. U can easily change to ur trained model. The models will be continuously updated.

build process

cd build

cmake ..

make

It will create an excutable named DetectText in bin folder.

Usage:

The excutable could be excuted in three modes: (1) Detect (2) Recognize (3) Detect and Recognize

Detect

Download the pretrained detector model and put it in model/

./DetectText --detector_graph='model/Detector_model.pb'
--image_filename='test_images/test_img1.jpg' --mode='detect' --output_filename='results/output_image.jpg'

Recognize

Download the pretrained recognizer model and put it in model/ Download the dictionary file and put it in model

./DetectText --recognizer_graph='model/Recognizer_model.pb'
--image_filename='test_images/recognize_image1.jpg' --mode='recognize'
--im_height=32 --im_width=128

Detect and Recognize

Download the pretrained detector and recognizer model and put it in model/ as described previously.

./DetectText --recognizer_graph=$recognizer_graph --detector_graph='model/Detector_model.pb'
--image_filename='model/Recognizer_model.pb' --mode='detect_and_read' --output_filename='results/output_image.jpg'

Model Description

Detector

Faster RCNN Detector Model The detector is trained with modified tensorflow [object detector api]: (https://github.com/tensorflow/models/tree/master/research/object_detection) I modify it by changing the proposal scheme to regress to the 4 coordinates of the oriented bounding box rather than regular rectangular bounding box. Check out this repo for the training code. Pretrained model: FasterRCNN_detector_model.pb
R2CNN will be updated. See R2CNN for details. The code is also modified with tnesorflow [object detector api]: (https://github.com/tensorflow/models/tree/master/research/object_detection) The training code will be released soon.

Recognizer

CTC scene text recognizer. The recognizer model follows the famous scene text recognition CRNN model
Spatial Attention OCR will be updated soon. It is based on GoogleOCR

Detect and Recognize

The whole scene text reading pipeline detects the text and rotate it horizontally and read it with recognizer. The pipeline is here:

Pretrained Models

You can play with the code with provided pretrained models.
They are not fully optimized yet, but could be used for being familiar with the code.
Check them out here: models

You will find two detection models called: (1) FasterRCNN_detector_model.pb (2) R2CNN_detector_model.pb
Two recognition models with their charset: (1) Recognizer_model.pb + charset_full.txt and (2)Recognizer_model_case_insen.pb + charset_case_insen.txt.
Full charset means English letters + digit and case insen means case insensitive English letters + digit. Let me know if u have any problens using them.

Reference and Related Projects

Faster RCNN Faster RCNN paper.
Tensorflow Object Detection API.
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition, reference paper for CRNN model.
tensorflow-cmake, Tutorial of Building Project with tensorflow using cmake.
R2CNN Reference paper for R2CNN.

Contact:

Dafang He. The Penn State University. [email protected] http://personal.psu.edu/duh188/

TextBoxes++: A Single-Shot Oriented Scene Text Detector

TextBoxes++: A Single-Shot Oriented Scene Text Detector Introduction This is an application for scene text detection (TextBoxes++) and recognition (CR

930 Jan 4, 2023

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

text-detection-ctpn Scene text detection based on ctpn (connectionist text proposal network). It is implemented in tensorflow. The origin paper can be

3.3k Dec 30, 2022

Detect and fix skew in images containing text

Alyn Skew detection and correction in images containing text Image with skew Image after deskew Install and use via pip! Recommended way(using virtual

230 Dec 21, 2022

A tensorflow implementation of EAST text detector

EAST: An Efficient and Accurate Scene Text Detector Introduction This is a tensorflow re-implementation of EAST: An Efficient and Accurate Scene Text

2.9k Jan 2, 2023

End-to-end pipeline for real-time scene text detection and recognition.

Real-time-Scene-Text-Detection-and-Recognition-System End-to-end pipeline for real-time scene text detection and recognition. The detection model use

89 Aug 4, 2022

TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

TextBoxes: A Fast Text Detector with a Single Deep Neural Network Introduction This paper presents an end-to-end trainable fast scene text detector, n

24 Apr 28, 2022

Comments

how to use python to training a crnn model

hello, this repository is very good! thanks you! but i want to train my dataset with Chinese char, so can you post your crnn train code?,thanks so much!

opened by jesen8 3
Read and detect mode

Hi,

I build the code and it is perfectly working in detect and recognize mode... I get strange error on read and detect mode ...

This is my command: ./bin/DetectText --detector_graph='model/FasterRCNN_detector_model.pb' --recognizer_graph='model/Recognizer_model_case_insen.pb' --dictionary_filename='model/charset_case_insen.txt' --image_filename='test_images/img_108.jpg' --mode='detect_and_read' --output_filename='test_images/output_image_2.jpg'

which gives me this errors: r/scene_text_reader.cpp:15] model/charset_case_insen.txt not implemented yet r/recognizer.h:49] Error dictionary opening file FasterRCNN

What is possibly wrong? Even I commented these lines in source and recompiled it and got another error somewhere else...

opened by EsiNaderi 2

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

Related tags

Overview

DeepSceneTextReader

Prerequsites

Status

build process

Usage:

Detect

Recognize

Detect and Recognize

Model Description

Detector

Recognizer

Detect and Recognize

Pretrained Models

Reference and Related Projects

Contact:

You might also like...

TextBoxes++: A Single-Shot Oriented Scene Text Detector

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Detect and fix skew in images containing text

A tensorflow implementation of EAST text detector

End-to-end pipeline for real-time scene text detection and recognition.

TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

Detect textlines in document images

Detect textlines in document images

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Comments

how to use python to training a crnn model

Read and detect mode

Owner

Dafang He

This is an API written in python that uses FastAPI. It is a simple API that can detect discord tokens in Images.

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Face Recognizer using Opencv Python

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

Implementation of EAST scene text detector in Keras

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector