Deep Learning Project – Handwritten Digit Recognition using Python

Free Machine Learning courses with 130+ real-time projects Start Now!!

Python Deep Learning Project

To make machines more intelligent, the developers are diving into machine learning and deep learning techniques. A human learns to perform a task by practicing and repeating it again and again so that it memorizes how to perform the tasks. Then the neurons in his brain automatically trigger and they can quickly perform the task they have learned. Deep learning is also very similar to this. It uses different types of neural network architectures for different types of problems. For example – object recognition, image and sound classification, object detection, image segmentation, etc.

This is the 11th project in the DataFlair’s series of 20 Python projects. I suggest you to bookmark the previous projects:

What is Handwritten Digit Recognition?

The handwritten digit recognition is the ability of computers to recognize human handwritten digits. It is a hard task for the machine because handwritten digits are not perfect and can be made with many different flavors. The handwritten digit recognition is the solution to this problem which uses the image of a digit and recognizes the digit present in the image.

About the Python Deep Learning Project

In this article, we are going to implement a handwritten digit recognition app using the MNIST dataset. We will be using a special type of deep neural network that is Convolutional Neural Networks. In the end, we are going to build a GUI in which you can draw the digit and recognize it straight away.

Prerequisites

The interesting Python project requires you to have basic knowledge of Python programming, deep learning with Keras library and the Tkinter library for building GUI.

Install the necessary libraries for this project using this command:

pip install numpy, tensorflow, keras, pillow,

The MNIST dataset

This is probably one of the most popular datasets among machine learning and deep learning enthusiasts. The MNIST dataset contains 60,000 training images of handwritten digits from zero to nine and 10,000 images for testing. So, the MNIST dataset has 10 different classes. The handwritten digits images are represented as a 28×28 matrix where each cell contains grayscale pixel value.

Download the full source code for the project

Building Python Deep Learning Project on Handwritten Digit Recognition

Below are the steps to implement the handwritten digit recognition project:

1. Import the libraries and load the dataset

First, we are going to import all the modules that we are going to need for training our model. The Keras library already contains some datasets and MNIST is one of them. So we can easily import the dataset and start working with it. The mnist.load_data() method returns us the training data, its labels and also the testing data and its labels.

import keras
from keras.datasets import mnist
from keras.models import Sequential
from keras.layers import Dense, Dropout, Flatten
from keras.layers import Conv2D, MaxPooling2D
from keras import backend as K

# the data, split between train and test sets
(x_train, y_train), (x_test, y_test) = mnist.load_data()

print(x_train.shape, y_train.shape)

2. Preprocess the data

The image data cannot be fed directly into the model so we need to perform some operations and process the data to make it ready for our neural network. The dimension of the training data is (60000,28,28). The CNN model will require one more dimension so we reshape the matrix to shape (60000,28,28,1).

x_train = x_train.reshape(x_train.shape[0], 28, 28, 1)
x_test = x_test.reshape(x_test.shape[0], 28, 28, 1)
input_shape = (28, 28, 1)

# convert class vectors to binary class matrices
y_train = keras.utils.to_categorical(y_train, num_classes)
y_test = keras.utils.to_categorical(y_test, num_classes)

x_train = x_train.astype('float32')
x_test = x_test.astype('float32')
x_train /= 255
x_test /= 255
print('x_train shape:', x_train.shape)
print(x_train.shape[0], 'train samples')
print(x_test.shape[0], 'test samples')

3. Create the model

Now we will create our CNN model in Python data science project. A CNN model generally consists of convolutional and pooling layers. It works better for data that are represented as grid structures, this is the reason why CNN works well for image classification problems. The dropout layer is used to deactivate some of the neurons and while training, it reduces offer fitting of the model. We will then compile the model with the Adadelta optimizer.

batch_size = 128
num_classes = 10
epochs = 10

model = Sequential()
model.add(Conv2D(32, kernel_size=(3, 3),activation='relu',input_shape=input_shape))
model.add(Conv2D(64, (3, 3), activation='relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Dropout(0.25))
model.add(Flatten())
model.add(Dense(256, activation='relu'))
model.add(Dropout(0.5))
model.add(Dense(num_classes, activation='softmax'))

model.compile(loss=keras.losses.categorical_crossentropy,optimizer=keras.optimizers.Adadelta(),metrics=['accuracy'])

4. Train the model

The model.fit() function of Keras will start the training of the model. It takes the training data, validation data, epochs, and batch size.

It takes some time to train the model. After training, we save the weights and model definition in the ‘mnist.h5’ file.

hist = model.fit(x_train, y_train,batch_size=batch_size,epochs=epochs,verbose=1,validation_data=(x_test, y_test))
print("The model has successfully trained")

model.save('mnist.h5')
print("Saving the model as mnist.h5")

5. Evaluate the model

We have 10,000 images in our dataset which will be used to evaluate how good our model works. The testing data was not involved in the training of the data therefore, it is new data for our model. The MNIST dataset is well balanced so we can get around 99% accuracy.

score = model.evaluate(x_test, y_test, verbose=0)
print('Test loss:', score[0])
print('Test accuracy:', score[1])

6. Create GUI to predict digits

Now for the GUI, we have created a new file in which we build an interactive window to draw digits on canvas and with a button, we can recognize the digit. The Tkinter library comes in the Python standard library. We have created a function predict_digit() that takes the image as input and then uses the trained model to predict the digit.

Then we create the App class which is responsible for building the GUI for our app. We create a canvas where we can draw by capturing the mouse event and with a button, we trigger the predict_digit() function and display the results.

Here’s the full code for our gui_digit_recognizer.py file:

from keras.models import load_model
from tkinter import *
import tkinter as tk
import win32gui
from PIL import ImageGrab, Image
import numpy as np

model = load_model('mnist.h5')

def predict_digit(img):
    #resize image to 28x28 pixels
    img = img.resize((28,28))
    #convert rgb to grayscale
    img = img.convert('L')
    img = np.array(img)
    #reshaping to support our model input and normalizing
    img = img.reshape(1,28,28,1)
    img = img/255.0
    #predicting the class
    res = model.predict([img])[0]
    return np.argmax(res), max(res)

class App(tk.Tk):
    def __init__(self):
        tk.Tk.__init__(self)

        self.x = self.y = 0

        # Creating elements
        self.canvas = tk.Canvas(self, width=300, height=300, bg = "white", cursor="cross")
        self.label = tk.Label(self, text="Thinking..", font=("Helvetica", 48))
        self.classify_btn = tk.Button(self, text = "Recognise", command =         self.classify_handwriting) 
        self.button_clear = tk.Button(self, text = "Clear", command = self.clear_all)

        # Grid structure
        self.canvas.grid(row=0, column=0, pady=2, sticky=W, )
        self.label.grid(row=0, column=1,pady=2, padx=2)
        self.classify_btn.grid(row=1, column=1, pady=2, padx=2)
        self.button_clear.grid(row=1, column=0, pady=2)

        #self.canvas.bind("<Motion>", self.start_pos)
        self.canvas.bind("<B1-Motion>", self.draw_lines)

    def clear_all(self):
        self.canvas.delete("all")

    def classify_handwriting(self):
        HWND = self.canvas.winfo_id() # get the handle of the canvas
        rect = win32gui.GetWindowRect(HWND) # get the coordinate of the canvas
        im = ImageGrab.grab(rect)

        digit, acc = predict_digit(im)
        self.label.configure(text= str(digit)+', '+ str(int(acc*100))+'%')

    def draw_lines(self, event):
        self.x = event.x
        self.y = event.y
        r=8
        self.canvas.create_oval(self.x-r, self.y-r, self.x + r, self.y + r, fill='black')

app = App()
mainloop()

Screenshots:

Summary

In this article, we have successfully built a Python deep learning project on handwritten digit recognition app. We have built and trained the Convolutional neural network which is very effective for image classification purposes. Later on, we build the GUI where we draw a digit on the canvas then we classify the digit and show the results.

Want to get hired as a Python expert? Practice the 150+ Python Interview Questions by DataFlair

Do share your views regarding the intermediate Python project in the comment section.

If you are Happy with DataFlair, do not forget to make us happy with your positive feedback on Google

Tags: deep learning project Handwritten digit recognition learning python project project based on python python deep learning project Python project python project example python project idea

uzmah says:
May 26, 2021 at 6:14 pm
ModuleNotFoundError: No module named ‘keras’
how to link that .h5 file in python
Reply
- Ritika Budhiraja says:
  July 14, 2021 at 1:35 am
  Same problem
  Reply
  - Pranav says:
    November 9, 2021 at 3:33 pm
    what to do now? somebody help
    Reply
- DataFlair says:
  November 10, 2021 at 5:52 pm
  You got this error becase the ‘Keras’ module is not installed. You can use the command ‘pip install keras’ in the promt. Hope this helps!
  Reply
kn187 says:
May 31, 2021 at 8:03 pm
Hi! Recently I downloaded your Handwritten Digit Recognition python project, but when I run it, it never recognizes digit correctly (f.e., when I draw a 6, it says that it’s a 2 with ~70% accuracy). I was wondering if anyone else had this problem, as I’m trying something similar for a college project, and if you know where the problem might be.
Thanks in advance!
Reply
- arjun says:
  June 16, 2021 at 11:31 pm
  same problem with me also
  go and search in GitHub for this project, I got it and it’s working fine for me
  Reply
  - aashna says:
    July 17, 2021 at 1:47 pm
    can you share the link of the github where you find the project
    thanks
    Reply
- aslam says:
  July 19, 2021 at 4:19 pm
  try scaling your training data
  Reply
- Justin says:
  August 16, 2021 at 6:00 am
  For anyone else, please read this!
  To put it simply, the image thats hand drawn (by you), is inverted, the training data has BG of black, and drawing of white, your drawing has BG of white and writing of black,
  Change import on line 5 to: from PIL import ImageGrab, ImageOps
  and function predict digit to:
  def predict_digit(img):
  #resize image to 28×28 pixels
  img = img.resize((28,28))
  #convert rgb to grayscale
  img = img.convert(‘L’)
  img = ImageOps.invert(img)
  img = np.array(img)
  #reshaping to support our model input and normalizing
  img = img.reshape(1,28,28,1)
  img = img/255.0
  #predicting the class
  res = model.predict([img])[0]
  return np.argmax(res), max(res)
  Basically adding 1 line of code on line 14/15, img = ImageOps.invert(img) this inverts the image
  Hope this helps!
  Reply
  - gatsu says:
    May 7, 2022 at 6:29 pm
    thanks dude, this works, everyone should add that single line of code
    Reply
Bui Le Ngoc Min says:
June 3, 2021 at 11:21 pm
I have run this model, I very disapointed, this model can’t recognition exactly my handwrite :((((
Reply
Bui Le Ngoc Min says:
June 4, 2021 at 12:44 am
I find the problem, I add this line before predict: img = 1 – img
It work very well
Reply
- arjun says:
  June 14, 2021 at 9:54 pm
  can you tell me which version of python and TensorFlow you used?
  Reply
- narendra says:
  May 20, 2022 at 3:59 pm
  bro is it worked
  Reply
Uzmah says:
June 4, 2021 at 11:55 am
For every hand drawn digit it is showing zero no other digit.
Plzz help to resolve tbis issue its important.
Reply
- Yogesh says:
  June 13, 2021 at 1:53 am
  Yup I’m having the same. No luck so far…
  Reply
Carina says:
June 7, 2021 at 1:45 pm
I have the seem issue with you, look forward the answer. If you have found the solution, could you please share with me?
Reply
kn187 says:
June 9, 2021 at 4:00 pm
@Bui Le Ngoe Min hi, can you send me your solution. It still doesn’t work for me with “img = 1 – img”
Reply
Allwyn Vincent says:
June 12, 2021 at 3:12 pm
Hello, I tried the source code, but when I ran the GUI part it gives an error saying “ModuleNotFoundError: No module named ‘win32gui’ “
Reply
- Yogesh says:
  June 13, 2021 at 1:31 am
  just use pip install pywin32
  Reply
- DataFlair says:
  November 10, 2021 at 5:53 pm
  You can install the pywin32 module to solve this issue. For this, you ca sue the command ‘pip install pywin32’. Hope this helps!
  Reply
  - nugie says:
    February 2, 2022 at 9:43 pm
    Hello, I’m using Ubuntu 18.04, when I tried the “pip install pywin32” command I got the following error:
    ERROR: Could not find a version that satisfies the requirement pywin32 (from versions: none)
    ERROR: No matching distribution found for pywin32
    Any solution for this? Thanks a lot for a great tutorial!
    Reply
Vamsi Krishna says:
June 15, 2021 at 10:21 am
it is always showing 2 with different accuracy values
Reply
uzmah says:
June 15, 2021 at 5:27 pm
same issue
Reply
uzmah says:
June 15, 2021 at 5:33 pm
please check like this we add this line …. it’s really urgent!!!!
before this???
img=1-img
def predict_digit(img):
#resize image to 28×28 pixels
img = img.resize((28,28))
#convert rgb to grayscale
img = img.convert(‘L’)
img = np.array(img)
#reshaping to support our model input and normalizing
img = img.reshape(1,28,28,1)
img = img/255.0
#predicting the class
res = model.predict([img])[0]
return np.argmax(res), max(res)
Reply
Mamta says:
July 12, 2021 at 8:41 pm
Very good information.
Reply
Ritika Budhiraja says:
July 14, 2021 at 1:37 am
ModuleNotFoundError: No module named ‘keras’
how to link that .h5 file in python
Please tell me how to import the module, I am unable to run the code!!
Reply
- Justin says:
  August 16, 2021 at 5:57 am
  from tensorflow import keras
  from keras.datasets import mnist
  from keras.models import Sequential
  from keras.layers import Dense, Dropout, Flatten, Conv2D, MaxPooling2D
  all needed imports for training.py
  Reply
SHAFIN JUNAYED says:
July 29, 2021 at 12:01 am
I am facing this error ” module ‘keras.utils.generic_utils’ has no attribute “. will anyone help me out?
Reply
Nguyen Tuan Dat says:
August 9, 2021 at 3:05 pm
Why???
“C:\Users\DAT_PC\anaconda3\envs\Handwritten digit recognizer\python.exe” “D:/Cong viec/Hoc tap/Hoc SPKTHY/Thay Hoan/4 Image Processing/Bai tap/Handwritten digit recognizer/train_digit_recognizer.py”
(60000, 28, 28) (60000,)
x_train shape: (60000, 28, 28, 1)
60000 train samples
10000 test samples
2021-08-09 16:32:45.919014: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-08-09 16:32:45.921326: I tensorflow/core/common_runtime/process_util.cc:146] Creating new thread pool with default inter op setting: 2. Tune using inter_op_parallelism_threads for best performance.
2021-08-09 16:32:46.138644: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:176] None of the MLIR Optimization Passes are enabled (registered 2)
Epoch 1/10
469/469 [==============================] – 28s 56ms/step – loss: 2.3054 – accuracy: 0.1000 – val_loss: 2.2957 – val_accuracy: 0.1011
Epoch 2/10
469/469 [==============================] – 26s 56ms/step – loss: 2.2952 – accuracy: 0.1102 – val_loss: 2.2854 – val_accuracy: 0.1017
Epoch 3/10
469/469 [==============================] – 31s 66ms/step – loss: 2.2861 – accuracy: 0.1230 – val_loss: 2.2750 – val_accuracy: 0.1052
Epoch 4/10
469/469 [==============================] – 30s 63ms/step – loss: 2.2764 – accuracy: 0.1369 – val_loss: 2.2639 – val_accuracy: 0.1136
Epoch 5/10
469/469 [==============================] – 27s 57ms/step – loss: 2.2667 – accuracy: 0.1486 – val_loss: 2.2518 – val_accuracy: 0.1296
Epoch 6/10
469/469 [==============================] – 27s 58ms/step – loss: 2.2557 – accuracy: 0.1612 – val_loss: 2.2382 – val_accuracy: 0.1483
Epoch 7/10
469/469 [==============================] – 26s 56ms/step – loss: 2.2435 – accuracy: 0.1764 – val_loss: 2.2231 – val_accuracy: 0.1685
Epoch 8/10
469/469 [==============================] – 27s 57ms/step – loss: 2.2302 – accuracy: 0.1907 – val_loss: 2.2059 – val_accuracy: 0.1919
Epoch 9/10
469/469 [==============================] – 26s 55ms/step – loss: 2.2156 – accuracy: 0.2042 – val_loss: 2.1867 – val_accuracy: 0.2298
Epoch 10/10
469/469 [==============================] – 26s 55ms/step – loss: 2.1998 – accuracy: 0.2178 – val_loss: 2.1653 – val_accuracy: 0.2797
The model has successfully trained
Test loss: 2.165282964706421
Test accuracy: 0.27970001101493835
Saving the model as mnist.h5
Process finished with exit code 0
Reply
Lukas says:
August 10, 2021 at 2:04 am
AttributeError Traceback (most recent call last)
in
18
19 # convert class vectors to binary class matrices
—> 20 y_train = keras.utils.to_categorical(y_train, num_classes)
21 y_test = keras.utils.to_categorical(y_test, num_classes)
22
AttributeError: module ‘keras.utils’ has no attribute ‘to_categorical’
I am getting this error message, not sure how to fix. Can anyone help
Reply
- DataFlair says:
  November 10, 2021 at 5:54 pm
  This problem might occurs in latest versions of Keras. To solve this problem, you can import using the line from keras.utils import np_utils or from keras import utils as np_utils. And then replace keras.utils.to_categorical with keras.utils.np_utils.to_categorical.
  Reply
Justin says:
August 16, 2021 at 5:53 am
The code for opening the model is wrong!!!!
def predict_digit(img):
#resize image to 28×28 pixels
img = img.resize((28,28))
#convert rgb to grayscale
img = img.convert(‘L’)
img = ImageOps.invert(img)
img = np.array(img)
#reshaping to support our model input and normalizing
img = img.reshape(1,28,28,1)
img = img/255.0
#predicting the class
res = model.predict([img])[0]
return np.argmax(res), max(res)
Use this instead!!
(Import statement) from PIL import ImageGrab, ImageOps
Reason: The image is inverted, the black is white and the white is black, in the training data the writing is white, but in actual testing with real hand writing, the writing is black, which is confusing the neural network!
Reply
- Rajeshwari says:
  December 13, 2022 at 8:25 pm
  when I run it, it never recognizes digit correctly .. please help
  Reply
Jens Nordmark says:
September 14, 2021 at 1:14 pm
Apart from the issue with inverting the image, the learning parameters differ significantly between the code in the Google drive repository and the text on this webpage. Check that out in order to get better results.
Reply
tho says:
January 21, 2022 at 10:59 am
should change the classify_handwriting method:
def classify_handwriting(self):
HWND = self.canvas.winfo_id() # get the handle of the canvas
rect = win32gui.GetWindowRect(HWND) # get the coordinate of the canvas
a,b,c,d = rect
rect=(a+4,b+4,c+100,d+100)
im = ImageGrab.grab(rect)
digit, acc = predict_digit(im)
self.label.configure(text= str(digit)+’, ‘+ str(int(acc*100))+’%’)
Reply
nugie says:
February 2, 2022 at 9:44 pm
Hello, I’m using Ubuntu 18.04, when I tried the “pip install pywin32” command I got the following error:
ERROR: Could not find a version that satisfies the requirement pywin32 (from versions: none)
ERROR: No matching distribution found for pywin32
Any solution for this? Thanks a lot for a great tutorial!
Reply
Pankaj Sharma says:
February 3, 2022 at 8:35 pm
Could not find “from keras.optimizers import Adadelta” what should i do for this ?
Reply
- Bin Mohammed Alahmady says:
  March 7, 2022 at 2:57 am
  The same proplem with my
  Reply
Prajanya Sahu says:
April 8, 2022 at 12:18 pm
what should i do
AttributeError Traceback (most recent call last)
d:\COSING VSC C\gp1.ipynb Cell 2′ in ()
3 input_shape = (28, 28, 1)
5 # convert class vectors to binary class matrices
—-> 6 y_train = keras.utils.to_categorical(y_train, num_classes)
7 y_test = keras.utils.to_categorical(y_test, num_classes)
9 x_train = x_train.astype(‘float32’)
AttributeError: module ‘keras.utils’ has no attribute ‘to_categorical’
Reply
Owais Imtiyaz Mirajkar says:
April 13, 2022 at 4:45 pm
This should work put this at the start in train_digit
” import tensorflow as tf
from tensorflow import keras “
Reply
vijay says:
May 1, 2022 at 4:16 pm
I am getting just display…with clear and recognize command. not even space for drawing…Just tk diaply is opening .
I have no errors, but still not getting output.
shows msg like this in the end:
2022-05-01 16:12:57.705049: I tensorflow/core/platform/cpu_feature_guard.cc:151] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX AVX2
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
Pls someone help in solving this.
tried on Visual studio code.
Reply
vishwajeet says:
May 17, 2022 at 8:35 pm
error showing win32gui is not run
Reply
SIROJIDDIN says:
May 20, 2022 at 9:09 am
GUI PACKAGE IS NOT INSTALLED
PLEASE HELP
HOW TO INSTALL
Reply
dineshgupta says:
May 28, 2022 at 9:17 pm
Hi All,
I am running the provided sample on the Oracle Linux8 server & I am getting the below issue
`(.venv) [dinesh@localhost handwritten-digit-recognition]$ /home/dinesh/pythonp/.venv/bin/python /home/dinesh/pythonp/handwritten-digit-recognition/gui-digit-recognizer.py
2022-05-28 21:08:26.460381: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library ‘libcudart.so.11.0’; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory
2022-05-28 21:08:26.460414: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
Traceback (most recent call last):
File “/home/dinesh/pythonp/handwritten-digit-recognition/gui-digit-recognizer.py”, line 2, in
from tkinter import *
ModuleNotFoundError: No module named ‘tkinter’`
Please help me on this
Reply
Mohan das says:
June 14, 2022 at 10:27 pm
I’m getting this NameError on line no 6 of Preprocess the data;
num_classes is not defined
could someone help me over this please
Reply
- sarthak ray says:
  February 2, 2023 at 1:34 pm
  same problem bro help us
  Reply
adarsh srivastava says:
October 16, 2022 at 2:23 am
No file or directory found at mnist.h5
how to fix this problem?
Reply
Smriti Upadhyay says:
December 22, 2022 at 2:18 pm
hey which algorithm is used here?
Reply

Deep Learning Project – Handwritten Digit Recognition using Python

What is Handwritten Digit Recognition?

About the Python Deep Learning Project

Prerequisites

The MNIST dataset

Building Python Deep Learning Project on Handwritten Digit Recognition

1. Import the libraries and load the dataset

2. Preprocess the data

3. Create the model

4. Train the model

5. Evaluate the model

6. Create GUI to predict digits

Summary

163 Responses

Leave a Reply Cancel reply

About DataFlair

Trending Courses

Trending Data Science Courses

Free Big Data Courses

Trending Programming Courses

Trending Data Science Tutorials

Trending Projects

Trending Programming Tutorials

Trending Tutorials