A post-processing tool for scanned sheets of paper.

Last update: Dec 07, 2022

Related tags

Overview

unpaper

Originally written by Jens Gulden — see AUTHORS for more information. Licensed under GNU GPL v2 — see COPYING for more information.

Overview

unpaper is a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies. The main purpose is to make scanned book pages better readable on screen after conversion to PDF. Additionally, unpaper might be useful to enhance the quality of scanned pages before performing optical character recognition (OCR).

unpaper tries to clean scanned images by removing dark edges that appeared through scanning or copying on areas outside the actual page content (e.g. dark areas between the left-hand-side and the right-hand-side of a double- sided book-page scan).

The program also tries to detect misaligned centering and rotation of pages and will automatically straighten each page by rotating it to the correct angle. This process is called "deskewing".

Note that the automatic processing will sometimes fail. It is always a good idea to manually control the results of unpaper and adjust the parameter settings according to the requirements of the input. Each processing step can also be disabled individually for each sheet.

See further documentation for the supported file formats notes.

Dependencies

The only hard dependency of unpaper is ffmpeg, which is used for file input and output.

Building instructions

unpaper uses GNU Autotools for its build system, so you should be able to execute the same commands used for other software packages:

./configure
make
sudo make install

There are, though, some recommendations about the way you build the code. Since the tasks are calculation-intensive, it is important to build with optimizations turned on:

./configure CFLAGS="-O2 -march-native -pipe"

Even better, if your compiler supports it, is to use Link-Time Optimizations, as that has shown that execution time can improve sensibly:

./configure CFLAGS="-O2 -march=native -pipe -flto"

Further optimizations such as -ftracer and -ftree-vectorize are thought to work, but their effect has not been evaluated so your mileage may vary.

Further Information

You can find more information on the basic concepts and the image processing in the available documentation.

A post-processing tool for scanned sheets of paper.

Related tags

Overview

unpaper

Overview

Dependencies

Building instructions

Further Information

Owner

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

Distort a video using Seam Carving (video) and Vibrato effect (sound)

Automatically remove the mosaics in images and videos, or add mosaics to them.

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

Automatically fishes for you while you are afk :)

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

PyQT5 app that colorize black & white pictures using CNN(use pre-trained model which was made with OpenCV)

Corner-based Region Proposal Network

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Discord QR Scam Code Generator + Token grab mobile device.

Extract tables from scanned image PDFs using Optical Character Recognition.

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

BoxToolBox is a simple python application built around the openCV library

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

Deep learning based page layout analysis

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

OCR of Chicago 1909 Renumbering Plan

FOTS Pytorch Implementation