Handwritten Number Recognition using CNN and Character Segmentation

Overview

Handwritten-Number-Recognition-With-Image-Segmentation

Info About this repository

This Repository is aimed at reading handwritten images of numbers and identifying the number written in it using Image Segmentation and Convolutional Neural Networks

Image Segmentation

The image segmentation algorithm uses contour tracing algorithm to separate the characters from a handwritten number. For Example :- 1567 should be separated as 1, 5, 6 and 7.

Digit Recognition

The digit recognition part is done by using a trained Convolutional Neural Network. Details about training and dataset can be found in my repository https://github.com/SparshaSaha/Neural-Nets/blob/master/Coursera%20Handwritten%20digits%20Recognition/Coursera_digit_recog_using_CNN.ipynb . This trained network is used to predict each digit and then ultimately predict the number.

Segmentation API for each number

Segmentation_utilities.py is the utility API for Image Segmentation

Requirements

Python3

TensorFlow

TFlearn

PIL

Scipy

Numpy

Matplotlib

Jupyter-Notebook

Running the project

The project can be run by using the CNN Classifier.ipnb file and providing the name of the image file which contains the image of the number to be classified.

You might also like...
Character Segmentation using TensorFlow
Character Segmentation using TensorFlow

Character Segmentation Segment characters and spaces in one text line,from this paper Chinese English mixed Character Segmentation as Semantic Segment

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data
ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

VistaOCR ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data Publications "How to Efficiently Increase Resolutio

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

OCR Resources This repository contains a collection of resources (including the papers and datasets) of OCR (Optical Character Recognition). Contents

make a better chinese character recognition OCR than tesseract
make a better chinese character recognition OCR than tesseract

deep ocr See README_en.md for English installation documentation. 只在ubuntu下面测试通过,需要virtualenv安装,安装路径可自行调整: git clone https://github.com/JinpengLI/deep

Provides OCR (Optical Character Recognition) services through web applications

OCR4all As suggested by the name one of the main goals of OCR4all is to allow basically any given user to independently perform OCR on a wide variety

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

Optical character recognition for Japanese text, with the main focus being Japanese manga
Optical character recognition for Japanese text, with the main focus being Japanese manga

Manga OCR Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with Tran

Detect handwritten words in a text-line (classic image processing method).
Detect handwritten words in a text-line (classic image processing method).

Word segmentation Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is fro

This is used to convert a string to an Image with Handwritten Characters.

Text-to-Handwriting-using-python This is used to convert a string to an Image with Handwritten Characters. text_to_handwriting(string: str, save_to: s

Comments
  • Can't Running

    Can't Running

    Running Version 2022/3/25 Python 3.9 PIP 22.0.4 Jupyter 1.0.0

    TFlearn 0.5.0 Matplotlib 3.5.1

    TensorFlow 2.8.0 Scipy 1.8.0 Numpy 1.22.3

    PIL ( it's gone )

    Error Code AttributeError: module 'tensorflow' has no attribute 'reset_default_graph' imread is deprecated in SciPy 1.0.0, and will be removed in 1.2.0. imsave is deprecated in SciPy 1.0.0, and will be removed in 1.2.0. AttributeError: module 'scipy.misc' has no attribute 'imread'

    opened by bitterysz 0
Owner
Sparsha Saha
Software Development Engineer @microsoft
Sparsha Saha
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

awesome-deep-text-detection-recognition A curated list of awesome deep learning based papers on text detection and recognition. Text Detection Papers

null 2.4k Jan 8, 2023
Text recognition (optical character recognition) with deep learning methods.

What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis | paper | training and evaluation data | failure cases and cle

Clova AI Research 3.2k Jan 4, 2023
Handwritten Text Recognition (HTR) using TensorFlow 2.x

Handwritten Text Recognition (HTR) system implemented using TensorFlow 2.x and trained on the Bentham/IAM/Rimes/Saint Gall/Washington offline HTR data

Arthur Flôr 160 Dec 21, 2022
Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. This Neural Network (NN) model recognizes the text contained in the images of segmented words.

Handwritten-Text-Recognition Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. T

null 27 Jan 8, 2023
Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Handwritten Text Recognition with TensorFlow Update 2021: more robust model, faster dataloader, word beam search decoder also available for Windows Up

Harald Scheidl 1.5k Jan 7, 2023
OCR software for recognition of handwritten text

Handwriting OCR The project tries to create software for recognition of a handwritten text from photos (also for Czech language). It uses computer vis

Břetislav Hájek 562 Jan 3, 2023
Apply different text recognition services to images of handwritten documents.

Handprint The Handwritten Page Recognition Test is a command-line program that invokes HTR (handwritten text recognition) services on images of docume

Caltech Library 117 Jan 2, 2023
Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Handwritten Line Text Recognition using Deep Learning with Tensorflow Description Use Convolutional Recurrent Neural Network to recognize the Handwrit

sushant097 224 Jan 7, 2023
Extract tables from scanned image PDFs using Optical Character Recognition.

ocr-table This project aims to extract tables from scanned image PDFs using Optical Character Recognition. Install Requirements Tesseract OCR sudo apt

Abhijeet Singh 209 Dec 6, 2022
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

gosseract OCR Golang OCR package, by using Tesseract C++ library. OCR Server Do you just want OCR server, or see the working example of this package?

Hiromu OCHIAI 1.9k Dec 28, 2022