Image to Text Converter (OCR)

Project Description: Image to Text Converter (FOSS Hackathon)

Overview

This project is an open-source Optical Character Recognition (OCR) system that extracts text from images without relying on external OCR APIs like Tesseract. It is designed for the FOSS Hackathon, focusing on self-contained text recognition using image processing techniques.

Key Features

Converts images to grayscale for better contrast.

Applies thresholding to enhance text visibility.

Segments individual characters from the image.

Provides a foundation for custom OCR models using machine learning.

Technology Stack

Python for scripting.

OpenCV for image processing.

NumPy for numerical operations.

Project Goals

Enable text extraction from images without third-party OCR libraries.

Serve as a base model for training a custom OCR system.

Encourage open-source contributions for further development.

Potential Applications

Automating data extraction from scanned documents.

Building offline OCR solutions for privacy-focused applications.

Enhancing accessibility by converting images into readable text.

No Issues, PRs or Discussions added.

Mohd Hasnain

mohd_hasnain

MUDIT GARG

mudit__garg

ABHAY KUMAR

abhay__kumar

Yash Dubey

yash_dubey