Repository collecting all the submodules for the new PyTorch-based OCR System.

NVIDIA Research Projects

Last update: Dec 9, 2022

Related tags

Computer Vision ocropus3

Overview

OCRopus3 is being replaced by OCRopus4, which is a rewrite using PyTorch 1.7; release should be soonish.

Please check github.com/tmbdev/ocropus for updates.

This is a metaproject that collects all the modules/components of the PyTorch-based OCRopus3 OCR system.

To install:

git clone https://github.com/NVlabs/ocropus3 --recursive
cd ocropus3
./install

CNN+LSTM+CTC based OCR implemented using tensorflow.

CNN_LSTM_CTC_Tensorflow CNN+LSTM+CTC based OCR(Optical Character Recognition) implemented using tensorflow. Note: there is No restriction on the numbe

356 Dec 8, 2022

Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

3.2k Dec 31, 2022

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

简介基于Tensorflow和Keras实现端到端的不定长中文字符检测和识别文本检测：CTPN 文本识别：DenseNet + CTC 环境部署 sh setup.sh 注：CPU环境执行前需注释掉for gpu部分，并解开for cpu部分的注释 Demo 将测试图片放入test_images

2.6k Dec 29, 2022

Visual Attention based OCR

Attention-OCR Authours: Qi Guo and Yuntian Deng Visual Attention based OCR. The model first runs a sliding CNN on the image (images are resized to hei

1.1k Jan 2, 2023

Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

3.2k Dec 31, 2022

A pure pytorch implemented ocr project including text detection and recognition

ocr.pytorch A pure pytorch implemented ocr project. Text detection is based CTPN and text recognition is based CRNN. More detection and recognition me

444 Dec 30, 2022

A Python wrapper for the tesseract-ocr API

tesserocr A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). tesserocr integrates directly with

1.7k Dec 31, 2022

FastOCR is a desktop application for OCR API.

FastOCR FastOCR is a desktop application for OCR API. Installation Arch Linux fastocr-git @ AUR Build from AUR or install with your favorite AUR helpe

58 Jan 7, 2023

OCR-D-compliant page segmentation

ocrd_segment This repository aims to provide a number of OCR-D-compliant processors for layout analysis and evaluation. Installation In your virtual e

59 Sep 10, 2022

Comments

Upgrade to Pytorch 1.0

@tmbdev In-order to use ocropus3 with the latest NVidia RTX cards we need Pytorch 1.0. Because Pytorch 1.0+ have support for Cuda 10. Waiting for your reply

opened by mrocr 1
Assertion Error

The stack trace for the error https://pastebin.com/zdSF7j36 . This happens when I run the command "!ocrobin-pred pages.tgz bin.tgz" in das2018-tutorial/30-full-pipeline.ipynb

opened by warisamin25 1

Owner

NVIDIA Research Projects

GitHub

It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

OCR-Tool It is a image ocr tool made in Python using the Tesseract-OCR engine with the pytesseract package and has a GUI. This is my second ever pytho

4 Jul 11, 2022

Indonesian ID Card OCR using tesseract OCR

KTP OCR Indonesian ID Card OCR using tesseract OCR KTP OCR is python-flask with tesseract web application to convert Indonesian ID Card to text / JSON

5 Dec 6, 2021

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

EasyOCR Ready-to-use OCR with 80+ languages supported including Chinese, Japanese, Korean and Thai. What's new 1 February 2021 - Version 1.2.3 Add set

16.7k Jan 3, 2023

OCR engine for all the languages

Description kraken is a turn-key OCR system optimized for historical and non-Latin script material. kraken's main features are: Fully trainable layout

431 Jan 4, 2023

list all open dataset about ocr.

ocr-open-dataset list all open dataset about ocr. printed dataset year Born-Digital Images (Web and Email) 2011-2015 COCO-Text 2017 Text Extraction fr

95 Nov 24, 2022

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

About An OCR translator tool. Made by me by utilizing Tesseract, compiled to .exe using pyinstaller. I made this program to learn more about python. I

41 Dec 30, 2022

Repository collecting all the submodules for the new PyTorch-based OCR System.

Related tags

Overview

You might also like...

CNN+LSTM+CTC based OCR implemented using tensorflow.

Python-based tools for document analysis and OCR

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Visual Attention based OCR

Python-based tools for document analysis and OCR

A pure pytorch implemented ocr project including text detection and recognition

A Python wrapper for the tesseract-ocr API

FastOCR is a desktop application for OCR API.

OCR-D-compliant page segmentation

Comments

Upgrade to Pytorch 1.0

Assertion Error

Owner

NVIDIA Research Projects

It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

Indonesian ID Card OCR using tesseract OCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

OCR engine for all the languages

list all open dataset about ocr.

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

Ocular is a state-of-the-art historical OCR system.

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Tesseract Open Source OCR Engine (main repository)

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR