OCR system for Arabic language that converts images of typed text to machine-encoded text.

Last update: Jan 05, 2023

Overview

Arabic OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.
The system currently supports only letters (29 letters) ا-ى , لا.
The system aims to solve a simpler problem of OCR with images that contain only Arabic characters (check the dataset link below to see a sample of the images).

Setup

Install python then run this command:

pip install -r requirements.txt

Run

Put the images in src/test directory
Go to src directory and run the following command
```
python OCR.py
```
Output folder will be created with:
- text folder which has text files corresponding to the images.
- running_time file which has the time taken to process each image.

Pipeline

Dataset

Link to dataset of images and the corresponding text: here.
We used 1000 images to generate character dataset that we used for training.

Examples

Line Segmentation

Word Segmentation

Character Segmentation

Performance

Average accuracy: 95%.
Average time per image: 16 seconds.

NOTE

We achieved these results when we used only the flatten image as feature.

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Related tags

Overview

Arabic OCR

Setup

Run

Pipeline

Dataset

Examples

Line Segmentation

Word Segmentation

Character Segmentation

Performance

References

Owner

Hussein Youssef

Controlling the computer volume with your hands // OpenCV

Text to QR-CODE

Geometric Augmentation for Text Image

POT : Python Optimal Transport

Apply different text recognition services to images of handwritten documents.

SRA's seminar on Introduction to Computer Vision Fundamentals

A pure pytorch implemented ocr project including text detection and recognition

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

かの有名なあの東方二次創作ソング、「bad apple!」のMVをPythonでやってみたって話

Learning Camera Localization via Dense Scene Matching, CVPR2021

Image augmentation for machine learning experiments.

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

Amazing 3D explosion animation using Pygame module.

Handwritten_Text_Recognition

Python-based tools for document analysis and OCR

OCR, Scene-Text-Understanding, Text Recognition

Image processing is one of the most common term in computer vision

Image Recognition Model Generator