sign-----tex

A convolutional neural network based project that converts hand gestures to english alphabet and words as per the american sign language standards

Repository Video ▶️

README

A very simple CNN(Convolutional Neural Networks) project.

Introduction

This is a simple CNN project which helps its users to convert the sign language directly into text. We have used different types of gestures which can be converted into text on the basis of the standard American Sign Language. The project is trained on Keras.It has word identification and working mathematical operations. The major technologies used are IMAGE PROCESSING and CNN (Convolutional Neural Network).

Sign Language

Sign language is the only medium of communication between the people with dis-abilities in hearing and speaking. Even if they need to present their thoughts or ideas to any individual, they will do that using actions. It may not be understood accurately and efficiently by the individual, which may result in misunderstanding leading to greater problems.

What we did here

At first, we created 44 gesture samples using OpenCV. For each gesture I captured 1200 images which were 50x50 pixels. All theses images were in grayscale which is stored in the gestures folder. The pictures were flipped using flip_images.py. This script flips every image along the vertical axis. Hence each gesture has 2400 images.
While creating this project,we learned what a CNN is and how it works. Best resources were Tensorflow's official website and machinelearningmastery.com.
Created a CNN which look a lot similar to this MNIST classifying model using both Tensorflow and Keras.This project is easily editable and one can add more gestures.
Then used the model which was trained using Keras.
As of today, we have stored the 44 gestures for which are 26 alphabets and 10 numbers of American Sign language and some other gestures. And trained the model on these images.

Requirements

Python 3.x
Tensorflow 1.5
Keras
OpenCV 3.4
h5py
pyttsx3
A good CPU (preferably with a GPU).
Use contrast background for better results

Installing the requirements

Start your terminal of cmd depending on your os.
If you have a NVidia GPU then make sure you have the prerequisites for Tensorflow GPU installation.Then use this commmand

pip install -r requirements_gpu.txt
In case you do not have a GPU then use this command

pip install -r requirements_cpu.txt

Creating a gesture

Watch the video guide for setting the hand histogram here.
First set your hand histogram. You do not need to do it again if you have already done it. But you do need to do it if the lighting conditions change. To do so type the command given below and follow the instructions below.

python sethandhist.py

A windows "Set hand histogram" will appear.
"Set hand histogram" will have 50 squares (5x10).
Put your hand in those squares. Make sure your hand covers all the squares.
Press 'c'. 1 other window will appear "Thresh".
On pressing 'c' only white patches corresponding to the parts of the image which has your skin color should appear on the "Thresh" window.
Make sure all the squares are covered by your hand.
In case you are not successful then move your hand a little bit and press 'c' again. Repeat this until you get a good histogram.
After you get a good histogram press 's' to save the histogram. All the windows close.

We already have added 44 (0-43) gestures. It is on user if he/she want to add even more gestures or replace the existing gestures. Hence this step is OPTIONAL. To create your own gestures or replace the given gestures do the following. It is done by the command given below. On starting executing this program, you will have to enter the gesture number and gesture name/text. Then an OpenCV window called "Capturing gestures" which will appear. In the webcam feed you will see a green window (inside which you will have to do your gesture) and a counter that counts the number of pictures stored.

python create_gestures.py
Press 'c' when you are ready with your gesture. Capturing gesture will begin after a few seconds. Move your hand a little bit here and there. You can pause capturing by pressing 'c' and resume it by pressing 'c'. Capturing resumes after a few secondAfter the counter reaches 1200 the window will close automatically.

After capturing all the gestures you can flip the images using

python flip_images.py
When you are done adding new gestures run the load_images.py file once. You do not need to run this file again until and unless you add a new gesture.

python load_images.py

Displaying all gestures

To see all the gestures that are stored in 'gestures/' folder run this command

python displayallgestures.py

Training a model

So training can be done with either Tensorflow or Keras. If you want to train using Tensorflow then run the cnntf.py file. If you want to train using Keras then use the cnnkeras.py file.

python cnntf.py python cnnkeras.py
1. If you use Tensorflow you will have the checkpoints and the metagraph file in the tmp/cnn_model3 folder.
2. If you use Keras you will have the model in the root directory by the name cnnmodelkeras2.h5.

You do not need to retrain your model every time. In case you added or removed a gesture then you need to retrain it.

Get model reports

To get the classification reports about the model make sure you have testimages and testlabels file which are generated by loadimages.py. In case you do not have them run loadimages.py file again. Then run this file

python getmodelreports.py
You will get the confusion matrix, f scores, precision and recall for the predictions by the model.

Testing gestures

First set your hand histogram. Watch the video guide for setting the hand histogram here. You do not need to do it again if you have already done it. But you do need to do it if the lighting conditions change. To do so type the command given below and follow the instructions below.

python sethandhist.py
- A windows "Set hand histogram" will appear.
- "Set hand histogram" will have 50 squares (5x10).
- Put your hand in those squares. Make sure your hand covers all the squares.
- Press 'c'. 1 other window will appear "Thresh".
- On pressing 'c' only white patches corresponding to the parts of the image which has your skin color should appear on the "Thresh" window.
- Make sure all the squares are covered by your hand.
- In case you are not successful then move your hand a little bit and press 'c' again. Repeat this until you get a good histogram.
- After you get a good histogram press 's' to save the histogram. All the windows close.
1. For recognition start the recognize_gesture.py file.
  
  python recognize_gesture.py
2. You will have a small green box inside which you need to do your gestures.

Using fun_util.py

Here is where you will have all the fun. 1. First set your hand histogram. You do not need to do it again if you have already done it. But you do need to do it if the lighting conditions change. To do so type the command given below and follow the instructions below.

python set_hand_hist.py

A windows "Set hand histogram" will appear.
"Set hand histogram" will have 50 squares (5x10).
Put your hand in those squares.
Press 'c'. 2 other windows will appear. "res" and "Thresh".
On pressing 'c' only the parts of the image which has your skin color should appear on the "res" window. White patches corresponding to this should appear on the "Thresh" window.
In case you are not successful then move your hand a little bit and press 'c' again. Repeat this until you get a good histogram.
After you get a good histogram press 's' to save the histogram. All the windows close.

Start the file.

python fun_util.py

Text Mode (Press 't' to go to text mode)

In text mode you can create your own words using fingerspellings or use the predefined gestures.
The text on screen will be converted to speech on removing your hand from the green box
Make sure you keep the same gesture on the green box for 15 frames or else the gesture will not be converted to text.

Calculator Mode (Press 'c' to go to calculator mode)

To confirm a digit make sure you keep the same gesture for 20 frames. On successful confirmation, the number will appear in the vertical center of the black part of the window.
To confirm a number make the "best of luck" gesture and keep in the green box for 25 frames. You will get used to the timing :P.
You can have any number of digits for both first number and second number.
Currently there are 10 operators.
During operator selection, 1 means '+', 2 means '-', 3 means '*', 4 means '/', 5 means '%', 6 means '**', 7 means '>>' or right shift operator, 8 means '<<' or left shift operator, 9 means '&' or bitwise AND and 0 means '|' or bitwise OR.

Conclusion

A fully working sign to text translator is created. The translator can be made more effective by adding more gestures. Although the system fails to work in accelerated motion, the system works perfectly for the uploaded 44 gestures at a normal pace.

Shreyash Gupta

Visal ahmad

ADITI SHARMA

Vaidik Jhawar

Project completed by 100 percent

September 13, 2020

Project created by Shreyash Gupta

September 11, 2020