Handwritten Number Recognition using CNN and Character Segmentation

Sparsha Saha

Last update: Aug 25, 2022

Related tags

Overview

Handwritten-Number-Recognition-With-Image-Segmentation

Info About this repository

This Repository is aimed at reading handwritten images of numbers and identifying the number written in it using Image Segmentation and Convolutional Neural Networks

Image Segmentation

The image segmentation algorithm uses contour tracing algorithm to separate the characters from a handwritten number. For Example :- 1567 should be separated as 1, 5, 6 and 7.

Digit Recognition

The digit recognition part is done by using a trained Convolutional Neural Network. Details about training and dataset can be found in my repository https://github.com/SparshaSaha/Neural-Nets/blob/master/Coursera%20Handwritten%20digits%20Recognition/Coursera_digit_recog_using_CNN.ipynb . This trained network is used to predict each digit and then ultimately predict the number.

Segmentation API for each number

Segmentation_utilities.py is the utility API for Image Segmentation

Requirements

Python3

TensorFlow

TFlearn

PIL

Scipy

Numpy

Matplotlib

Jupyter-Notebook

Running the project

The project can be run by using the CNN Classifier.ipnb file and providing the name of the image file which contains the image of the number to be classified.

You might also like...

Character Segmentation using TensorFlow

Character Segmentation Segment characters and spaces in one text line,from this paper Chinese English mixed Character Segmentation as Semantic Segment

26 Aug 25, 2022

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

VistaOCR ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data Publications "How to Efficiently Increase Resolutio

ISI Center for Vision, Image, Speech, and Text Analytics

21 Dec 8, 2021

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

OCR Resources This repository contains a collection of resources (including the papers and datasets) of OCR (Optical Character Recognition). Contents

363 Jan 3, 2023

make a better chinese character recognition OCR than tesseract

deep ocr See README_en.md for English installation documentation. 只在ubuntu下面测试通过，需要virtualenv安装，安装路径可自行调整： git clone https://github.com/JinpengLI/deep

1.5k Dec 28, 2022

Provides OCR (Optical Character Recognition) services through web applications

OCR4all As suggested by the name one of the main goals of OCR4all is to allow basically any given user to independently perform OCR on a wide variety

174 Dec 31, 2022

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

Optical character recognition for Japanese text, with the main focus being Japanese manga

Manga OCR Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with Tran

327 Jan 1, 2023

Detect handwritten words in a text-line (classic image processing method).

Word segmentation Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is fro

190 Jan 3, 2023

This is used to convert a string to an Image with Handwritten Characters.

Text-to-Handwriting-using-python This is used to convert a string to an Image with Handwritten Characters. text_to_handwriting(string: str, save_to: s

3 Aug 15, 2022

Comments

Can't Running

Running Version 2022/3/25 Python 3.9 PIP 22.0.4 Jupyter 1.0.0

TFlearn 0.5.0 Matplotlib 3.5.1

TensorFlow 2.8.0 Scipy 1.8.0 Numpy 1.22.3

PIL ( it's gone )

Error Code AttributeError: module 'tensorflow' has no attribute 'reset_default_graph' imread is deprecated in SciPy 1.0.0, and will be removed in 1.2.0. imsave is deprecated in SciPy 1.0.0, and will be removed in 1.2.0. AttributeError: module 'scipy.misc' has no attribute 'imread'

opened by bitterysz 0

Handwritten Number Recognition using CNN and Character Segmentation

Related tags

Overview

Handwritten-Number-Recognition-With-Image-Segmentation

Info About this repository

Image Segmentation

Digit Recognition

Segmentation API for each number

Requirements

Running the project

You might also like...

Character Segmentation using TensorFlow

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

make a better chinese character recognition OCR than tesseract

Provides OCR (Optical Character Recognition) services through web applications

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Optical character recognition for Japanese text, with the main focus being Japanese manga

Detect handwritten words in a text-line (classic image processing method).

This is used to convert a string to an Image with Handwritten Characters.

Comments

Can't Running

Owner

Sparsha Saha

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Text recognition (optical character recognition) with deep learning methods.

Handwritten Text Recognition (HTR) using TensorFlow 2.x

Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. This Neural Network (NN) model recognizes the text contained in the images of segmented words.

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

OCR software for recognition of handwritten text

Apply different text recognition services to images of handwritten documents.

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Extract tables from scanned image PDFs using Optical Character Recognition.

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library