Document Blur Detection

For general blurred image, using the variance of Laplacian operator is a good solution. But as for the blur detection of documents, especially for document images with blurred text, text detection should be used to detect blurred text area.

This package mainly depends on opencv and paddle, to install them with requirements.txt,

pip install -r requirements

Inference model of PaddleOCR is used to detect text location. You can download the inference model with https://paddleocr.bj.bcebos.com/dygraph_v2.0/ch/ch_ppocr_mobile_v2.0_det_infer.tar. The text detection code in this project refers to the PaddleOCR project. If you want to get more information about PaddleOCR, you can go to https://github.com/PaddlePaddle/PaddleOCR to check it out.

To run main.py, use the following command.

python ./main.py --image './text_blur.jpg' --thresh_v 300 --thresh_d 0.7

If you would like to blur document images, you can run blur_ops.py to simulate motion blur and Gaussian blur. Use the following command.

python blur_ops.py --image_path './bean-license.png' --output_path './gaussian_blur.jpg' --blur_type 'gaussian blur'/'motion blur'

Some results:

Document blur detection based on Laplacian operator and text detection.

Related tags

Overview

Document Blur Detection

Owner

JoeyLr

Handwritten Number Recognition using CNN and Character Segmentation

Pixel art search engine for opengameart

Demo processor to illustrate OCR-D Python API

This repository summarized computer vision theories.

Controlling the computer volume with your hands // OpenCV

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

📷 Face Recognition using Haar-Cascade Classifier, OpenCV, and Python

Provides OCR (Optical Character Recognition) services through web applications

CNN+LSTM+CTC based OCR implemented using tensorflow.

Multi-choice answer sheet correction system using computer vision with opencv & python.

kaldi-asr/kaldi is the official location of the Kaldi project.

Awesome anomaly detection in medical images

This tool will help you convert your text to handwriting xD

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

TableBank: A Benchmark Dataset for Table Detection and Recognition

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

pulse2percept: A Python-based simulation framework for bionic vision

An expandable and scalable OCR pipeline

The world's simplest facial recognition api for Python and the command line