Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

azhar shaikh

Last update: Oct 10, 2022

Related tags

Overview

Scene Text-Spotting based on PSEnet+CRNN

Pytorch implementation of an end to end Text-Spotter with a PSEnet text detector and CRNN text recognizer. We plan to grow this repository into an open research platform for multi-lingual text detection and recognition from natural scene images, targeted towards low-resource languages.

Requirements

Python 3.6.5
Pytorch 1.2
pyclipper
Polygon 3.0.8
OpenCV 3.4.1

Demo

Download the trained CRNN and PSEnet models from the links provided below.
Copy paths of the models and paste them in params.py
run end-end.py

python end-end.py --img [path to image] --e2e_config_name [end to end config name]

Pre-trained Models

Both PSEnet and CRNN pre-trained models can be found here: gdrive

the PSEnet model is a multi-lingual text detector, trained on MLT 2019. Works quite well!
the CRNN recognizes Hindi, Bangla, Malayalam, Kanada, Tamil, Telugu, Odia, Sanskrit, Marathi!

Download the models in models/ directory and modify params.py if required.

Training instructions

To train your own detection model refer to this file.
To train your own recognition model refer to this file.

Samples

Contributors

Azhar Shaikh, PES University LinkedIn
Nishant Sinha, OffNote Labs

Work done as part of Internship with OffNote Labs.

References

If this repository helps you, please star it. Thank you!

You might also like...

Single Shot Text Detector with Regional Attention

Single Shot Text Detector with Regional Attention Introduction SSTD is initially described in our ICCV 2017 spotlight paper. A third-party implementat

215 Dec 7, 2022

🖺 OCR using tensorflow with attention

tensorflow-ocr 🖺 OCR using tensorflow with attention, batteries included Installation git clone --recursive http://github.com/pannous/tensorflow-ocr

646 Nov 11, 2022

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Attention-based OCR Visual attention-based OCR model for image recognition with additional tools for creating TFRecords datasets and exporting the tra

933 Dec 29, 2022

Comments

No module named 'util.tflogger'

@ekshaks @rahzaazhar When running

python ./Detection/PSEnet/train_ic19MLT.py

I am getting error:

(final) home@home-desktop:~/p8/PAN-PSEnet$ python ./Detection/PSEnet/train_ic19MLT.py
Traceback (most recent call last):
  File "./Detection/PSEnet/train_ic19MLT.py", line 22, in <module>
    from util.tflogger import tfLogger 
ModuleNotFoundError: No module named 'util.tflogger'

opened by ghost 3

recognize pth file

Hi, thank you for your excellent job. I found a few pth files in your recognition folder on gdrive and downloaded them all. But I don't know which one is the model to recognize which language. I choosed the 'hin_best.pth' to recognize the image in your demo folder, but it seems work not quite well and I can not read Hindi. So, please tell me the meaning of abbreviation in your file name, thanks again!!

opened by daben233-bit 2
Documentation on how to train, along with only detecting text boxes
@rahzaazhar @ekshaks Thank you for your hard work,

If I want to only detect the text boxes of an image, what command shall I run

Documentation on how to train
opened by ghost 0

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Related tags

Overview

Scene Text-Spotting based on PSEnet+CRNN

Requirements

Demo

Pre-trained Models

Training instructions

Samples

Contributors

References

You might also like...

Single Shot Text Detector with Regional Attention

🖺 OCR using tensorflow with attention

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Visual Attention based OCR

CNN+Attention+Seq2Seq

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.

This is the implementation of the paper "Gated Recurrent Convolution Neural Network for OCR"

EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition

Comments

No module named 'util.tflogger'

recognize pth file

Documentation on how to train, along with only detecting text boxes

Owner

azhar shaikh

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

📷 This repository is focused on having various feature implementation of OpenCV in Python.

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

A python scripts that uses 3 different feature extraction methods such as SIFT, SURF and ORB to find a book in a video clip and project trailer of a movie based on that book, on to it.

Using Opencv ,based on Augmental Reality(AR) and will show the feature matching of image and then by finding its matching

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'