Character Segmentation using TensorFlow

Last update: Aug 25, 2022

Related tags

Computer Vision Character-Segmentation

Overview

Character Segmentation

Segment characters and spaces in one text line,from this paper Chinese English mixed Character Segmentation as Semantic Segmentation

dependencies

tensorflow1.3,or 1.4

python3

differences from the paper

the paper set the label of the space to 1,others 0.But that is not hommizate,because the space between two characters is many pixes,the network is hard to distinguish which is 1,which is 0,even though it can work.Here we change to set the characters to 1,spaces to 0.

architecture of the network

Heuristic Rules for balanced_Binary_CrossEntropy

make training images and labels

python3 make_train_images.py

train

python3 train_char_seg.py

test

python3 test_char_seg.py

other_things

you can choose first make traing images and then use these maked images to train ,or training and making at the same time.all you need to do is change below codes in data_generator.py

enqueuer = GeneratorEnqueuer(generator_on_the_fly(**kwargs), use_multiprocessing=False)
#enqueuer = GeneratorEnqueuer(generator_from_folder(**kwargs), use_multiprocessing=False)

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

Este programa tem o intuito de ser um modificador de arquivos PDF. Os arquivos PDFs podem ser 3: PDFs verdadeiros - em que podem ser selecionados o ti

2 Oct 11, 2021

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

Optical character recognition for Japanese text, with the main focus being Japanese manga

Manga OCR Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with Tran

327 Jan 1, 2023

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

DeepSceneTextReader This is a c++ project deploying a deep scene text reading pipeline. It reads text from natural scene images. Prerequsites The proj

49 Sep 10, 2022

Handwritten Text Recognition (HTR) using TensorFlow 2.x

Character Segmentation using TensorFlow

Related tags

Overview

Character Segmentation

dependencies

differences from the paper

architecture of the network

Heuristic Rules for balanced_Binary_CrossEntropy

make training images and labels

train

test

other_things

You might also like...

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Optical character recognition for Japanese text, with the main focus being Japanese manga

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

Handwritten Text Recognition (HTR) using TensorFlow 2.x

TextBoxes re-implement using tensorflow

🖺 OCR using tensorflow with attention

CNN+LSTM+CTC based OCR implemented using tensorflow.

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Owner

Extract tables from scanned image PDFs using Optical Character Recognition.

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Official implementation of Character Region Awareness for Text Detection (CRAFT)

CRAFT-Pyotorch：Character Region Awareness for Text Detection Reimplementation for Pytorch

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

make a better chinese character recognition OCR than tesseract

Provides OCR (Optical Character Recognition) services through web applications

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

Text recognition (optical character recognition) with deep learning methods.