The CIS OCR Post Correction Tool PoCoTo

Source code for the Java-based PoCoTo client enabling fast interactive batch corrections of complete OCR error series in OCR'ed historical documents. For a detailed description see the PoCoTo Manual.

The lastest compiled binary can be downloaded here.

References

PoCoTo has originally been written by Thorsten Vobl as part of his master's thesis in computational linguistics at CIS during the IMPACT project.

It has been further developed as a CLARIN-D Kurationsprojekt by Florian Fink and Uwe Springmann at CIS.

Its underlying technology is described in the following publication:

Vobl, Thorsten, Annette Gotscharek, Uli Reffle, Christoph Ringlstetter, and Klaus U. Schulz. 2014. “PoCoTo - an Open Source System for Efficient Interactive Postcorrection of OCRed Historical Texts.” In Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage, 57–61. DATeCH ’14. New York, NY, USA: ACM. doi:http://doi.org/10.1145/2595188.2595197.

The CIS OCR PostCorrectionTool

Related tags

Overview

The CIS OCR Post Correction Tool PoCoTo

References

Owner

CIS OCR Group

chineseocr/table_line 表格线检测模型pytorch版

An organized collection of tutorials and projects created for aspriring computer vision students.

Vietnamese Language Detection and Recognition

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

Read Japanese manga inside browser with selectable text.

Text Detection from images using OpenCV

Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

Document Image Dewarping

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

SRA's seminar on Introduction to Computer Vision Fundamentals

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

The first open-source library that detects the font of a text in a image.

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

Automatically remove the mosaics in images and videos, or add mosaics to them.