Project Description: Image to Text Converter (FOSS Hackathon)
Overview
This project is an open-source Optical Character Recognition (OCR) system that extracts text from images without relying on external OCR APIs like Tesseract. It is designed for the FOSS Hackathon, focusing on self-contained text recognition using image processing techniques.
Key Features
Converts images to grayscale for better contrast.
Applies thresholding to enhance text visibility.
Segments individual characters from the image.
Provides a foundation for custom OCR models using machine learning.
Technology Stack
Python for scripting.
OpenCV for image processing.
NumPy for numerical operations.
Project Goals
Enable text extraction from images without third-party OCR libraries.
Serve as a base model for training a custom OCR system.
Encourage open-source contributions for further development.
Potential Applications
Automating data extraction from scanned documents.
Building offline OCR solutions for privacy-focused applications.
Enhancing accessibility by converting images into readable text.