227 Python Math-ocr Libraries

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

DeepSceneTextReader This is a c++ project deploying a deep scene text reading pipeline. It reads text from natural scene images. Prerequsites The proj

49 Sep 10, 2022

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection Abstract This is a caffe re-implementation of R2CNN: Rotational Region CNN fo

80 Dec 28, 2021

Python library to extract tabular data from images and scanned PDFs

Overview ExtractTable - API to extract tabular data from images and scanned PDFs The motivation is to make it easy for developers to extract tabular d

165 Dec 31, 2022

Extract tables from scanned image PDFs using Optical Character Recognition.

ocr-table This project aims to extract tables from scanned image PDFs using Optical Character Recognition. Install Requirements Tesseract OCR sudo apt

209 Dec 6, 2022

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Table of Contents Overview Requirements Demo Modules Overview This python package contains modules to help with finding and extracting tabular data fr

311 Dec 24, 2022

Detect textlines in document images

Textline Detection Detect textlines in document images Introduction This tool performs border, region and textline detection from document image data

70 Jun 30, 2022

OCR software for recognition of handwritten text

Handwriting OCR The project tries to create software for recognition of a handwritten text from photos (also for Czech language). It uses computer vis

562 Jan 3, 2023

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Handwritten Text Recognition with TensorFlow Update 2021: more robust model, faster dataloader, word beam search decoder also available for Windows Up

1.5k Jan 7, 2023

Let's explore how we can extract text from forms

Form Segmentation Let's explore how we can extract text from any forms / scanned pages. Objectives The goal is to find an algorithm that can extract t

42 Jun 5, 2022

Document Layout Analysis

Eynollah Document Layout Analysis Introduction This tool performs document layout analysis (segmentation) from image data and returns the results as P

198 Dec 29, 2022

OCR-D-compliant page segmentation

ocrd_segment This repository aims to provide a number of OCR-D-compliant processors for layout analysis and evaluation. Installation In your virtual e

59 Sep 10, 2022

Detect handwritten words in a text-line (classic image processing method).

Word segmentation Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is fro

190 Jan 3, 2023

Detect textlines in document images

Textline Detection Detect textlines in document images Introduction This tool performs border, region and textline detection from document image data

70 Jun 30, 2022

An application of high resolution GANs to dewarp images of perturbed documents

Docuwarp This project is focused on dewarping document images through the usage of pix2pixHD, a GAN that is useful for general image to image translat

97 Dec 25, 2022

The MATH Dataset

Measuring Mathematical Problem Solving With the MATH Dataset This is the repository for Measuring Mathematical Problem Solving With the MATH Dataset b

267 Dec 26, 2022

Animation engine for explanatory math videos

Manim is an engine for precise programatic animations, designed for creating explanatory math videos. Note, there are two versions of manim. This repo

48.9k Jan 3, 2023

Scan, index, and archive all of your paper documents

[ en | de | el ] Important news about the future of this project It's been more than 5 years since I started this project on a whim as an effort to tr

7.8k Jan 6, 2023

:mag: Ambar: Document Search Engine

🔍 Ambar: Document Search Engine Ambar is an open-source document search engine with automated crawling, OCR, tagging and instant full-text search. Am

1.9k Jan 9, 2023

A Python library created to assist programmers with complex mathematical functions

libmaths was created not only as a learning experience for me, but as a way to make mathematical models in seconds for Python users using mat

73 Oct 2, 2022

Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

3.2k Jan 4, 2023

FastOCR is a desktop application for OCR API.

FastOCR FastOCR is a desktop application for OCR API. Installation Arch Linux fastocr-git @ AUR Build from AUR or install with your favorite AUR helpe

58 Jan 7, 2023

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

text recognition toolbox 1. 项目介绍该项目是基于pytorch深度学习框架，以统一的改写方式实现了以下6篇经典的文字识别论文，论文的详情如下。该项目会持续进行更新，欢迎大家提出问题以及对代码进行贡献。模型论文标题发表年份模型方法划分 CRNN 《An End-t

168 Dec 24, 2022

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

EasyOCR Ready-to-use OCR with 80+ languages supported including Chinese, Japanese, Korean and Thai. What's new 1 February 2021 - Version 1.2.3 Add set

16.7k Jan 3, 2023

Python Math-ocr Resources

Python math-ocr Libraries

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

Python library to extract tabular data from images and scanned PDFs

Extract tables from scanned image PDFs using Optical Character Recognition.

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Detect textlines in document images

OCR software for recognition of handwritten text

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Let's explore how we can extract text from forms

Document Layout Analysis

OCR-D-compliant page segmentation

Detect handwritten words in a text-line (classic image processing method).

Detect textlines in document images

An application of high resolution GANs to dewarp images of perturbed documents

The MATH Dataset

Animation engine for explanatory math videos

Scan, index, and archive all of your paper documents

:mag: Ambar: Document Search Engine

A Python library created to assist programmers with complex mathematical functions

Python-based tools for document analysis and OCR

FastOCR is a desktop application for OCR API.

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

A Python wrapper for the tesseract-ocr API

Tesseract Open Source OCR Engine (main repository)

A computer algebra system written in pure Python

A Python wrapper for the tesseract-ocr API

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python Math-ocr Resources

Python math-ocr Libraries

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

Python library to extract tabular data from images and scanned PDFs

Extract tables from scanned image PDFs using Optical Character Recognition.

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Detect textlines in document images

OCR software for recognition of handwritten text

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Let's explore how we can extract text from forms

Document Layout Analysis

OCR-D-compliant page segmentation

Detect handwritten words in a text-line (classic image processing method).

Detect textlines in document images

An application of high resolution GANs to dewarp images of perturbed documents

The MATH Dataset

Animation engine for explanatory math videos

Scan, index, and archive all of your paper documents

:mag: Ambar: Document Search Engine

A Python library created to assist programmers with complex mathematical functions

Python-based tools for document analysis and OCR

FastOCR is a desktop application for OCR API.

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

A Python wrapper for the tesseract-ocr API

Tesseract Open Source OCR Engine (main repository)

A computer algebra system written in pure Python

A Python wrapper for the tesseract-ocr API

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Related tags