RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Overview

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

For more details, please refer to our paper.

Citing

Please cite the related works in your publications if it helps your research:

@inproceedings{liao2018rotation,
  title={Rotation-Sensitive Regression for Oriented Scene Text Detection},
  author={Liao, Minghui and Zhu, Zhen and Shi, Baoguang and Xia, Gui-song and Bai, Xiang},
  booktitle={Proc. CVPR},
  pages={5909--5918},
  year={2018}
}

Models

  1. model trained on ICDAR 2015 Incidental Text
    BaiduYun
    Google Drive

Training of other models are in progress.

Demo

Download the ICDAR 2015 model and place it in "./models/ic15/"

python examples/text/demo.py

The detection results and recognition results are in "./visu_demo/"

Training

Coming soon

You might also like...
End-to-end pipeline for real-time scene text detection and recognition.
End-to-end pipeline for real-time scene text detection and recognition.

Real-time-Scene-Text-Detection-and-Recognition-System End-to-end pipeline for real-time scene text detection and recognition. The detection model use

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection Abstract This is a caffe re-implementation of R2CNN: Rotational Region CNN fo

Scene text detection and recognition based on Extremal Region(ER)
Scene text detection and recognition based on Extremal Region(ER)

Scene text recognition A real-time scene text recognition algorithm. Our system is able to recognize text in unconstrain background. This algorithm is

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection Introduction The code and trained models of: TextField: Learning A Deep

A curated list of papers and resources for scene text detection and recognition

Awesome Scene Text A curated list of papers and resources for scene text detection and recognition The year when a paper was first published, includin

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

SceneTextPapers Tracking the latest progress in Scene Text Detection and Recognition: must-read papers well organized Information about this repositor

A toolbox of scene text detection and recognition

FudanOCR This toolbox contains the implementations of the following papers: Scene Text Telescope: Text-Focused Scene Image Super-Resolution [Chen et a

 AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car
OCR, Scene-Text-Understanding, Text Recognition

Scene-Text-Understanding Survey [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper [2014-Front.Comput.Sci] Scene Text Detection and

Comments
  • How's the result in ICDAR-2015 without pretraining?

    How's the result in ICDAR-2015 without pretraining?

    Hi,Liao.Counld you please tell me how's the F1 score in ICDAR-2015 incident images if you only use 1k training images without using synthtext dataset to pretrain?

    opened by xbcReal 0
  • Gpu is forced? make with cpu_only = 1 failed.

    Gpu is forced? make with cpu_only = 1 failed.

    I think running demo with pretrained model don't need a GPU. So I change the Makefile.config with cpu_only := 1 but the following erroe occurs when make:

    ../lib/libcaffe.a(layer_factory.cpp.o):(.data.rel.ro._ZTVN5caffe16ConvolutionLayerIfEE[_ZTVN5caffe16ConvolutionLayerIfEE]+0xd0):对‘caffe::ConvolutionLayer::RotateARF_gpu()’未定义的引用 ../lib/libcaffe.a(layer_factory.cpp.o):(.data.rel.ro._ZTVN5caffe16ConvolutionLayerIfEE[_ZTVN5caffe16ConvolutionLayerIfEE]+0xe0):对‘caffe::ConvolutionLayer::AlignARF_gpu()’未定义的引用 ../lib/libcaffe.a(layer_factory.cpp.o):(.data.rel.ro._ZTVN5caffe16ConvolutionLayerIdEE[_ZTVN5caffe16ConvolutionLayerIdEE]+0xd0):对‘caffe::ConvolutionLayer::RotateARF_gpu()’未定义的引用 ../lib/libcaffe.a(layer_factory.cpp.o):(.data.rel.ro._ZTVN5caffe16ConvolutionLayerIdEE[_ZTVN5caffe16ConvolutionLayerIdEE]+0xe0):对‘caffe::ConvolutionLayer::AlignARF_gpu()’未定义的引用 collect2: error: ld returned 1 exit status tools/CMakeFiles/convert_imageset.dir/build.make:130: recipe for target 'tools/convert_imageset' failed make[2]: *** [tools/convert_imageset] Error 1 CMakeFiles/Makefile2:441: recipe for target 'tools/CMakeFiles/convert_imageset.dir/all' failed make[1]: *** [tools/CMakeFiles/convert_imageset.dir/all] Error 2 Makefile:129: recipe for target 'all' failed make: *** [all] Error 2

    opened by jingmouren 0
  • cmake is failing

    cmake is failing

    cmake is failing, I already have caffe, numpy & Protobuf installed

    home@home-lnx:~/programs/RRD$ source activate py3
    (py3) home@home-lnx:~/programs/RRD$ mkdir build
    (py3) home@home-lnx:~/programs/RRD$ cd build
    (py3) home@home-lnx:~/programs/RRD/build$ cmake ..
    -- The C compiler identification is GNU 7.3.0
    -- The CXX compiler identification is GNU 7.3.0
    -- Check for working C compiler: /usr/bin/cc
    -- Check for working C compiler: /usr/bin/cc -- works
    -- Detecting C compiler ABI info
    -- Detecting C compiler ABI info - done
    -- Detecting C compile features
    -- Detecting C compile features - done
    -- Check for working CXX compiler: /usr/bin/c++
    -- Check for working CXX compiler: /usr/bin/c++ -- works
    -- Detecting CXX compiler ABI info
    -- Detecting CXX compiler ABI info - done
    -- Detecting CXX compile features
    -- Detecting CXX compile features - done
    -- Looking for pthread.h
    -- Looking for pthread.h - found
    -- Looking for pthread_create
    -- Looking for pthread_create - not found
    -- Looking for pthread_create in pthreads
    -- Looking for pthread_create in pthreads - not found
    -- Looking for pthread_create in pthread
    -- Looking for pthread_create in pthread - found
    -- Found Threads: TRUE  
    -- Boost version: 1.65.1
    -- Found the following Boost libraries:
    --   system
    --   thread
    --   filesystem
    --   regex
    --   chrono
    --   date_time
    --   atomic
    -- Could NOT find GFlags (missing: GFLAGS_INCLUDE_DIR GFLAGS_LIBRARY) 
    -- Could NOT find Glog (missing: GLOG_INCLUDE_DIR GLOG_LIBRARY) 
    CMake Error at /usr/share/cmake-3.10/Modules/FindPackageHandleStandardArgs.cmake:137 (message):
      Could NOT find Protobuf (missing: Protobuf_INCLUDE_DIR)
    Call Stack (most recent call first):
      /usr/share/cmake-3.10/Modules/FindPackageHandleStandardArgs.cmake:378 (_FPHSA_FAILURE_MESSAGE)
      /usr/share/cmake-3.10/Modules/FindProtobuf.cmake:543 (FIND_PACKAGE_HANDLE_STANDARD_ARGS)
      cmake/ProtoBuf.cmake:4 (find_package)
      cmake/Dependencies.cmake:24 (include)
      CMakeLists.txt:43 (include)
    
    
    -- Configuring incomplete, errors occurred!
    See also "/home/home/programs/RRD/build/CMakeFiles/CMakeOutput.log".
    See also "/home/home/programs/RRD/build/CMakeFiles/CMakeError.log".
    
    opened by ghost 0
  • boost compile error

    boost compile error

    found error like: boost/geometry/policies/relate/direction.hpp:133:24: error: expected primary-expression before ‘template’ boost/geometry/policies/relate/direction.hpp:133:24: error: expected ‘:’ before ‘template’ /boost/geometry/policies/is_valid/failing_reason_policy.hpp:137:57: error: ‘Data1’ was not declared in this scope

    which version of boost is used?

    opened by ustcxiayu 0
Owner
Minghui Liao
Minghui Liao, a Ph.D. student of Huazhong University of Science and Technology.
Minghui Liao
YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)

YOLOv5_DOTA_OBB YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection) Datasets and pretrained checkpoint Datasets : DOTA Pretrained Ch

null 1.1k Dec 30, 2022
This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

This is an oriented object detector based on tensorflow object detection API. Most of the code is not changed except for those related to the need of

Dafang He 30 Oct 22, 2022
An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

InceptText-Tensorflow An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Orien

GeorgeJoe 115 Dec 12, 2022
Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

Security camera running OpenCV for object and motion detection. The camera will send email with image of any objects it detects. It also runs a server that provides web interface with live stream video.

Peace 10 Jun 30, 2021
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation". For more details, please

Pengyuan Lyu 309 Dec 6, 2022
TextBoxes++: A Single-Shot Oriented Scene Text Detector

TextBoxes++: A Single-Shot Oriented Scene Text Detector Introduction This is an application for scene text detection (TextBoxes++) and recognition (CR

Minghui Liao 930 Jan 4, 2023
A novel region proposal network for more general object detection ( including scene text detection ).

DeRPN: Taking a further step toward more general object detection DeRPN is a novel region proposal network which concentrates on improving the adaptiv

Deep Learning and Vision Computing Lab, SCUT 151 Dec 12, 2022
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

Total-Text-Dataset (Official site) Updated on April 29, 2020 (Detection leaderboard is updated - highlighted E2E methods. Thank you shine-lcy.) Update

Chee Seng Chan 671 Dec 27, 2022
Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

Scene-Text-Detection-with-SPCNET Unofficial repository for [Scene Text Detection with Supervised Pyramid Context Network][https://arxiv.org/abs/1811.0

null 121 Oct 15, 2021