Detect and fix skew in images containing text

Last update: Dec 21, 2022

Overview

Alyn

Skew detection and correction in images containing text

Image with skew

Image after deskew

Install and use via pip!

Recommended way(using virtualenv):

mkdir alyn-test
cd alyn test
mkvirtualenv .
pip install alyn
source bin/activate

To detect skew angle in image:

from alyn import SkewDetect
sd = SkewDetect(
	input_file='path_to_file',
	batch_path='optional_batch_processing_path',
	output_file='optional_text_file_output_path',
	display_output='Yes/No')
sd.run()

Extra options:

sigma:canny edge detection blurring
plot_hough: display hough lines detected
num_peaks: control the number of hough line peaks

To deskew image:

from alyn import Deskew
d = Deskew(
	input_file='path_to_file',
	display_image='preview the image on screen',
	output_file='path_for_deskewed image',
	r_angle='offest_angle_in_degrees_to_control_orientation')`
d.run()

Requires

numpy
matplotlib
scipy
scikit-image

Techniques used

Canny Edge Detection
Hough Transform

Features

Detect the skew in given images
Display the output
Save the output to txt file
Batch process files in a directory
View Hough Transform of a given image
Set the number of peaks for Hough Transform and Sigma for Canny Edge detection
Rotate the image to remove the skew

How the skew detection works

The skew detection script takes image file as input, then performs the following steps:

Converts the image to greyscale
Performs Canny Edge Detection on the Image
Calculates the Hough Transform values
Determines the peaks
Determines the deviation of each peaks from 45 degree angle
Segregates the detected peaks into bins
Chooses the probable skew angle using the value in the bins

The deskew script uses the skew angle determined using skew detection script to remove the skew from the image.

Using scripts directly(older method)

Image skew calculation using skew_detect.py

To calculate the skew angle for a given image file, use -i option followed by the path to file:

./skew_detect.py -i image.jpg

To save output in a text file add -o option followed by the output file name:

./skew_detect.py -i image.jpg -o output.txt

To display output information add -d option followed by a string Yes:

./skew_detect.py -i image.jpg -d Yes

To batch process files in a directory, use -b option followed by the path to directory:

./skew_detect.py  -b examples

To display Hough Transform plot for an image,:

./skew_detect.py -i image.jpg -p Yes

Output of the Hough Transform:

To set the value of sigma for Gaussian blurring in Canny Edge Detection, use -s option followed by the desired value:

./skew_detect.py -i image.jpg -s 3

To set the number of peaks collected from Hough Transform, use -n option followed by the desired value:

./skew_detect.py -i image.jpg -n 10

Image Deskew using deskew.py

To perform a simple deskew and display the output:

./deskew.py -i image.jpg -d Yes

To save the deskewed image, use the following:

./deskew.py -i image.jpg -o rotated.jpg

In some cases the result image might be upside down or the text may be running vertical, To fix this, use -r followed by the desired angle in int:

./deskew.py -i image.jpg -o rotated.jpg -r 90

To generate data for experimental purposes, run the test_img_gen.py in test_data folder. This will generate images containing a white line having angle between 0 to 180 degrees.

Detect and fix skew in images containing text

Related tags

Overview

Alyn

Skew detection and correction in images containing text

Image with skew

Image after deskew

Install and use via pip!

To detect skew angle in image:

Extra options:

To deskew image:

Requires

Techniques used

Features

How the skew detection works

Using scripts directly(older method)

Image skew calculation using skew_detect.py

Output of the Hough Transform:

Image Deskew using deskew.py

Owner

Kakul

Some Boring Research About Products Recognition 、Duplicate Img Detection、Img Stitch、OCR

かの有名なあの東方二次創作ソング、「bad apple!」のMVをPythonでやってみたって話

Make OpenCV camera loops less of a chore by skipping the boilerplate and getting right to the interesting stuff

Convert scans of handwritten notes to beautiful, compact PDFs

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'

OCR powered screen-capture tool to capture information instead of images

A simple demo program for using OpenCV on Android

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Motion Detection Squid Game with OpenCV Python

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

SRA's seminar on Introduction to Computer Vision Fundamentals

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

Table Extraction Tool

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

a deep learning model for page layout analysis / segmentation.