A curated list of resources dedicated to scene text localization and recognition

Overview

Awesome

Scene Text Localization & Recognition Resources

A curated list of resources dedicated to scene text localization and recognition. Any suggestions and pull requests are welcome.

Papers & Code

Overview

  • [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper
  • [2014-Front.Comput.Sci] Scene Text Detection and Recognition: Recent Advances and Future Trends paper

Visual Geometry Group, University of Oxford

CUHK & SIAT

  • [2016-arXiv] Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network paper
  • [2016-AAAI] Reading Scene Text in Deep Convolutional Sequences paper
  • [2016-TIP] Text-Attentional Convolutional Neural Networks for Scene Text Detection paper
  • [2014-ECCV] Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees paper

Media and Communication Lab, HUST

  • [2016-CVPR] Robust scene text recognition with automatic rectification paper
  • [2016-CVPR] Multi-oriented text detection with fully convolutional networks paper
  • [2015-CoRR] An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition paper code github

AI Lab, Stanford

  • [2012-ICPR, Wang] End-to-End Text Recognition with Convolutional Neural Networks paper code SVHN Dataset
  • [2012-PhD thesis, David Wu] End-to-End Text Recognition with Convolutional Neural Networks paper

Others

  • [2018-CVPR] FOTS: Fast Oriented Text Spotting With a Unified Network paper
  • [2018-IJCAI] IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection paper
  • [2018-AAAI] PixelLink: Detecting Scene Text via Instance Segmentation paper code
  • [2018-AAAI] SEE: Towards Semi-Supervised End-to-End Scene Text Recognition paper code
  • [2017-arXiv] Fused Text Segmentation Networks for Multi-oriented Scene Text Detection paper
  • [2017-arXiv] WeText: Scene Text Detection under Weak Supervision paper
  • [2017-ICCV] Single Shot Text Detector with Regional Attention paper
  • [2017-ICCV] WordSup: Exploiting Word Annotations for Character based Text Detection paper
  • [2017-arXiv] R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection paper
  • [2017-CVPR] EAST: An Efficient and Accurate Scene Text Detector paper code
  • [2017-arXiv] Cascaded Segmentation-Detection Networks for Word-Level Text Spottingpaper
  • [2017-arXiv] Deep Direct Regression for Multi-Oriented Scene Text Detectionpaper
  • [2017-CVPR] Detecting oriented text in natural images by linking segments paper code
  • [2017-CVPR] Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detectionpaper
  • [2017-arXiv] Arbitrary-Oriented Scene Text Detection via Rotation Proposals paper
  • [2017-AAAI] TextBoxes: A Fast Text Detector with a Single Deep Neural Network paper code
  • [2017-ICCV] Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework paper code
  • [2016-CVPR] Recursive Recurrent Nets with Attention Modeling for OCR in the Wild paper
  • [2016-arXiv] COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images paper
  • [2016-arXiv] DeepText:A Unified Framework for Text Proposal Generation and Text Detection in Natural Images paper
  • [2015 ICDAR] Object Proposals for Text Extraction in the Wild paper code
  • [2014-TPAMI] Word Spotting and Recognition with Embedded Attributes paper homepage code

Datasets

  • MLT 2017 2017

    • 7200 training, 1800 validation images
    • Bounding box, text transcription, and script annotations
    • Task: text detection, script identification
  • COCO-Text (Computer Vision Group, Cornell) 2016

    • 63,686 images, 173,589 text instances, 3 fine-grained text attributes.
    • Task: text location and recognition
    • COCO-Text API
  • Synthetic Word Dataset (Oxford, VGG) 2014

    • 9 million images covering 90k English words
    • Task: text recognition, segmentation
    • download
  • IIIT 5K-Words 2012

    • 5000 images from Scene Texts and born-digital (2k training and 3k testing images)
    • Each image is a cropped word image of scene text with case-insensitive labels
    • Task: text recognition
    • download
  • StanfordSynth(Stanford, AI Group) 2012

    • Small single-character images of 62 characters (0-9, a-z, A-Z)
    • Task: text recognition
    • download
  • MSRA Text Detection 500 Database (MSRA-TD500) 2012

    • 500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)
    • Chinese, English or mixture of both
    • Task: text detection
  • Street View Text (SVT) 2010

    • 350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)
    • Only word level bounding boxes are provided with case-insensitive labels
    • Task: text location
  • KAIST Scene_Text Database 2010

    • 3000 images of indoor and outdoor scenes containing text
    • Korean, English (Number), and Mixed (Korean + English + Number)
    • Task: text location, segmantation and recognition
  • Chars74k 2009

    • Over 74K images from natural images, as well as a set of synthetically generated characters
    • Small single-character images of 62 characters (0-9, a-z, A-Z)
    • Task: text recognition
  • ICDAR Benchmark Datasets

Dataset Discription Competition Paper
ICDAR 2015 1000 training images and 500 testing images paper link
ICDAR 2013 229 training images and 233 testing images paper link
ICDAR 2011 229 training images and 255 testing images paper link
ICDAR 2005 1001 training images and 489 testing images paper link
ICDAR 2003 181 training images and 251 testing images(word level and character level) paper link

Blogs

Issues
Owner
CarlosTao
CarlosTao
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

awesome-deep-text-detection-recognition A curated list of awesome deep learning based papers on text detection and recognition. Text Detection Papers

null 2.2k Oct 15, 2021
OCR, Scene-Text-Understanding, Text Recognition

Scene-Text-Understanding Survey [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper [2014-Front.Comput.Sci] Scene Text Detection and

Alan Tang 349 Oct 5, 2021
Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

SceneTextPapers Tracking the latest progress in Scene Text Detection and Recognition: must-read papers well organized Information about this repositor

Shangbang Long 713 Oct 13, 2021
A curated list of resources dedicated to scene text localization and recognition

Scene Text Localization & Recognition Resources A curated list of resources dedicated to scene text localization and recognition. Any suggestions and

CarlosTao 1.6k Oct 15, 2021
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

OCR Resources This repository contains a collection of resources (including the papers and datasets) of OCR (Optical Character Recognition). Contents

Zuming Huang 343 Sep 25, 2021
A curated list of promising OCR resources

Call for contributor(paper summary,dataset generation,algorithm implementation and any other useful resources) awesome-ocr A curated list of promising

wanghaisheng 1.5k Oct 17, 2021
Text recognition (optical character recognition) with deep learning methods.

What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis | paper | training and evaluation data | failure cases and cle

Clova AI Research 2.6k Oct 20, 2021
Scene text detection and recognition based on Extremal Region(ER)

Scene text recognition A real-time scene text recognition algorithm. Our system is able to recognize text in unconstrain background. This algorithm is

HSIEH, YI CHIA 150 Aug 31, 2021
Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text

Christian Bartz 555 Sep 30, 2021
Scene text recognition

AttentionOCR for Arbitrary-Shaped Scene Text Recognition Introduction This is the ranked No.1 tensorflow based scene text spotting algorithm on ICDAR2

null 732 Sep 30, 2021
Handwritten Text Recognition (HTR) using TensorFlow 2.x

Handwritten Text Recognition (HTR) system implemented using TensorFlow 2.x and trained on the Bentham/IAM/Rimes/Saint Gall/Washington offline HTR data

Arthur Flôr 101 Oct 21, 2021
The world's simplest facial recognition api for Python and the command line

Face Recognition You can also read a translated version of this file in Chinese 简体中文版 or in Korean 한국어 or in Japanese 日本語. Recognize and manipulate fa

Adam Geitgey 41.8k Oct 22, 2021
Links to awesome OCR projects

Awesome OCR This list contains links to great software tools and libraries and literature related to Optical Character Recognition (OCR). Contribution

Konstantin Baierer 1.9k Oct 20, 2021
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

Total-Text-Dataset (Official site) Updated on April 29, 2020 (Detection leaderboard is updated - highlighted E2E methods. Thank you shine-lcy.) Update

Chee Seng Chan 620 Oct 13, 2021
TextBoxes++: A Single-Shot Oriented Scene Text Detector

TextBoxes++: A Single-Shot Oriented Scene Text Detector Introduction This is an application for scene text detection (TextBoxes++) and recognition (CR

Minghui Liao 907 Oct 21, 2021
Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

AFSD: Learning Salient Boundary Feature for Anchor-free Temporal Action Localization This is an official implementation in PyTorch of AFSD. Our paper

Tencent YouTu Research 79 Oct 20, 2021
End-to-end pipeline for real-time scene text detection and recognition.

Real-time-Scene-Text-Detection-and-Recognition-System End-to-end pipeline for real-time scene text detection and recognition. The detection model use

Fangneng Zhan 85 Oct 4, 2021
MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition Python 2.7 Python 3.6 MORAN is a network with rectification mechanism for

Canjie Luo 568 Oct 15, 2021