Image to Text Converter (OCR)

About A simple open-source Optical Character Recognition (OCR) system that extracts text from images without relying on external OCR APIs
Description
Issues / PRs
Team Members

Project Description: Image to Text Converter (FOSS Hackathon)

Overview

This project is an open-source Optical Character Recognition (OCR) system that extracts text from images without relying on external OCR APIs like Tesseract. It is designed for the FOSS Hackathon, focusing on self-contained text recognition using image processing techniques.

Key Features

Converts images to grayscale for better contrast.

Applies thresholding to enhance text visibility.

Segments individual characters from the image.

Provides a foundation for custom OCR models using machine learning.

Technology Stack

Python for scripting.

OpenCV for image processing.

NumPy for numerical operations.

Project Goals

Enable text extraction from images without third-party OCR libraries.

Serve as a base model for training a custom OCR system.

Encourage open-source contributions for further development.

Potential Applications

Automating data extraction from scanned documents.

Building offline OCR solutions for privacy-focused applications.

Enhancing accessibility by converting images into readable text.

No Issues, PRs or Discussions added.
Mohd Hasnain
Mohd Hasnain
mohd_hasnain
MUDIT  GARG
MUDIT GARG
mudit__garg
ABHAY  KUMAR
ABHAY KUMAR
abhay__kumar
Yash Dubey
Yash Dubey
yash_dubey