A curated list of resources dedicated to scene text localization and recognition

CarlosTao

Last update: Dec 22, 2022

Related tags

Overview

Scene Text Localization & Recognition Resources

A curated list of resources dedicated to scene text localization and recognition. Any suggestions and pull requests are welcome.

Papers & Code

Overview

[2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper
[2014-Front.Comput.Sci] Scene Text Detection and Recognition: Recent Advances and Future Trends paper

Visual Geometry Group, University of Oxford

[2016-IJCV, M. Jaderberg] Reading Text in the Wild with Convolutional Neural Networks paper demo homepage
[2016-CVPR, A Gupta] Synthetic Data for Text Localisation in Natural Images paper code data
[2015-ICLR, M. Jaderberg] Deep structured output learning for unconstrained text recognition paper
[2015-D.Phil Thesis, M. Jaderberg] Deep Learning for Text Spotting paper
[2014-ECCV, M. Jaderberg] Deep Features for Text Spotting paper code model GitXiv
[2014-NIPS, M. Jaderberg] Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition paper homepage model

CUHK & SIAT

[2016-arXiv] Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network paper
[2016-AAAI] Reading Scene Text in Deep Convolutional Sequences paper
[2016-TIP] Text-Attentional Convolutional Neural Networks for Scene Text Detection paper
[2014-ECCV] Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees paper

Media and Communication Lab, HUST

[2016-CVPR] Robust scene text recognition with automatic rectification paper
[2016-CVPR] Multi-oriented text detection with fully convolutional networks paper
[2015-CoRR] An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition paper code github

AI Lab, Stanford

[2012-ICPR, Wang] End-to-End Text Recognition with Convolutional Neural Networks paper code SVHN Dataset
[2012-PhD thesis, David Wu] End-to-End Text Recognition with Convolutional Neural Networks paper

Others

[2018-CVPR] FOTS: Fast Oriented Text Spotting With a Unified Network paper
[2018-IJCAI] IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection paper
[2018-AAAI] PixelLink: Detecting Scene Text via Instance Segmentation paper code
[2018-AAAI] SEE: Towards Semi-Supervised End-to-End Scene Text Recognition paper code
[2017-arXiv] Fused Text Segmentation Networks for Multi-oriented Scene Text Detection paper
[2017-arXiv] WeText: Scene Text Detection under Weak Supervision paper
[2017-ICCV] Single Shot Text Detector with Regional Attention paper
[2017-ICCV] WordSup: Exploiting Word Annotations for Character based Text Detection paper
[2017-arXiv] R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection paper
[2017-CVPR] EAST: An Efficient and Accurate Scene Text Detector paper code
[2017-arXiv] Cascaded Segmentation-Detection Networks for Word-Level Text Spottingpaper
[2017-arXiv] Deep Direct Regression for Multi-Oriented Scene Text Detectionpaper
[2017-CVPR] Detecting oriented text in natural images by linking segments paper code
[2017-CVPR] Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detectionpaper
[2017-arXiv] Arbitrary-Oriented Scene Text Detection via Rotation Proposals paper
[2017-AAAI] TextBoxes: A Fast Text Detector with a Single Deep Neural Network paper code
[2017-ICCV] Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework paper code
[2016-CVPR] Recursive Recurrent Nets with Attention Modeling for OCR in the Wild paper
[2016-arXiv] COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images paper
[2016-arXiv] DeepText:A Unified Framework for Text Proposal Generation and Text Detection in Natural Images paper
[2015 ICDAR] Object Proposals for Text Extraction in the Wild paper code
[2014-TPAMI] Word Spotting and Recognition with Embedded Attributes paper homepage code

Datasets

MLT 2017 2017
- 7200 training, 1800 validation images
- Bounding box, text transcription, and script annotations
- Task: text detection, script identification
COCO-Text (Computer Vision Group, Cornell) 2016
- 63,686 images, 173,589 text instances, 3 fine-grained text attributes.
- Task: text location and recognition
- COCO-Text API
Synthetic Word Dataset (Oxford, VGG) 2014
- 9 million images covering 90k English words
- Task: text recognition, segmentation
- download
IIIT 5K-Words 2012
- 5000 images from Scene Texts and born-digital (2k training and 3k testing images)
- Each image is a cropped word image of scene text with case-insensitive labels
- Task: text recognition
- download
StanfordSynth(Stanford, AI Group) 2012
- Small single-character images of 62 characters (0-9, a-z, A-Z)
- Task: text recognition
- download
MSRA Text Detection 500 Database (MSRA-TD500) 2012
- 500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)
- Chinese, English or mixture of both
- Task: text detection
Street View Text (SVT) 2010
- 350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)
- Only word level bounding boxes are provided with case-insensitive labels
- Task: text location
KAIST Scene_Text Database 2010
- 3000 images of indoor and outdoor scenes containing text
- Korean, English (Number), and Mixed (Korean + English + Number)
- Task: text location, segmantation and recognition
Chars74k 2009
- Over 74K images from natural images, as well as a set of synthetically generated characters
- Small single-character images of 62 characters (0-9, a-z, A-Z)
- Task: text recognition
ICDAR Benchmark Datasets

Dataset	Discription	Competition Paper
ICDAR 2015	1000 training images and 500 testing images	`paper`
ICDAR 2013	229 training images and 233 testing images	`paper`
ICDAR 2011	229 training images and 255 testing images	`paper`
ICDAR 2005	1001 training images and 489 testing images	`paper`
ICDAR 2003	181 training images and 251 testing images(word level and character level)	`paper`

Blogs

You might also like...

Scene text detection and recognition based on Extremal Region(ER)

Scene text recognition A real-time scene text recognition algorithm. Our system is able to recognize text in unconstrain background. This algorithm is

155 Dec 6, 2022

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

SceneTextPapers Tracking the latest progress in Scene Text Detection and Recognition: must-read papers well organized Information about this repositor

763 Jan 1, 2023

A toolbox of scene text detection and recognition

FudanOCR This toolbox contains the implementations of the following papers: Scene Text Telescope: Text-Focused Scene Image Super-Resolution [Chen et a

170 Dec 26, 2022

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition Python 2.7 Python 3.6 MORAN is a network with rectification mechanism for

595 Dec 27, 2022

Scene text recognition

AttentionOCR for Arbitrary-Shaped Scene Text Recognition Introduction This is the ranked No.1 tensorflow based scene text spotting algorithm on ICDAR2

777 Jan 9, 2023

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text

572 Jan 5, 2023

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

CRNN_Tensorflow This is a TensorFlow implementation of a Deep Neural Network for scene text recognition. It is mainly based on the paper "An End-to-En

1000 Dec 27, 2022

A list of hyperspectral image super-solution resources collected by Junjun Jiang

A list of hyperspectral image super-resolution resources collected by Junjun Jiang. If you find that important resources are not included, please feel free to contact me.

301 Jan 5, 2023

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

AdvancedEAST AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST:An Efficient and Accurate Scene Text Dete

1.2k Dec 29, 2022

Comments

regarding ICDAR 2011 dataset for challenge 2 reading text in images

Dear Sir I need the dataset for ICDAR 2011 Reading Text in Scene Images challenge 2, IT contains 485 images . I tried best to search it but I am not getting it. The link given in icdar 2011 competition paper is not working. Please share the dataset. Regards

opened by riturajsoni 1

A curated list of resources dedicated to scene text localization and recognition

Related tags

Overview

Scene Text Localization & Recognition Resources

Papers & Code

Overview

Visual Geometry Group, University of Oxford

CUHK & SIAT

Media and Communication Lab, HUST

AI Lab, Stanford

Others

Datasets

Blogs

You might also like...

Scene text detection and recognition based on Extremal Region(ER)

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

A toolbox of scene text detection and recognition

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Scene text recognition

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

A list of hyperspectral image super-solution resources collected by Junjun Jiang

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

Comments

regarding ICDAR 2011 dataset for challenge 2 reading text in images

Owner

CarlosTao

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

A curated list of papers, code and resources pertaining to image composition

A curated list of promising OCR resources

A curated list of awesome synthetic data for text location and recognition

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

OCR, Scene-Text-Understanding, Text Recognition

Learning Camera Localization via Dense Scene Matching, CVPR2021

End-to-end pipeline for real-time scene text detection and recognition.

A curated list of resources dedicated to scene text localization and recognition

Related tags

Overview

Scene Text Localization & Recognition Resources

Papers & Code

Overview

Visual Geometry Group, University of Oxford

CUHK & SIAT

Media and Communication Lab, HUST

AI Lab, Stanford

Others

Datasets

Blogs

You might also like...

Scene text detection and recognition based on Extremal Region(ER)

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

A toolbox of scene text detection and recognition

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Scene text recognition

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

A list of hyperspectral image super-solution resources collected by Junjun Jiang

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

Comments

regarding ICDAR 2011 dataset for challenge 2 reading text in images

Owner

CarlosTao

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

A curated list of papers, code and resources pertaining to image composition

A curated list of promising OCR resources

A curated list of awesome synthetic data for text location and recognition

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

OCR, Scene-Text-Understanding, Text Recognition

Learning Camera Localization via Dense Scene Matching, CVPR2021

End-to-end pipeline for real-time scene text detection and recognition.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約