docstrum

Last update: Dec 13, 2022

Related tags

Computer Vision docstrum

Overview

Docstrum Algorithm

Getting Started

This repo is for developing a Docstrum algorithm presented by O’Gorman (1993).

Disclaimer

This source code is built on top of the work by Chadoliver. Please find the original code from here (https://github.com/chadoliver/cosc428-structor).

Objective

This project aims at segmenting a document image into meaningful components. The domain of image is specified on historical machine-printed/hand-written document image.

Dependencies

python 2.7
Packages:
- numpy
- cv2

Process

Pre-processing Optional for vertical-line removal
- Blurring Bilateral Filtering
- Otsu's thresholding
- Morphological erosion & dilation
- Smoothing (Averaging)
- Static thresholding
Nearest-Neighbor Clustering and Docstrum Plot
Spacing and Orientation Estimation
Determination of Text-lines
Structural Block Determination
Post-processing
- TBD

Evaluation

Citing Docstrum

O'Gorman, L., 1993. The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11), pp.1162-1173. pdf.

@article{o1993document,
  title={The document spectrum for page layout analysis},
  author={O'Gorman, Lawrence},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  volume={15},
  number={11},
  pages={1162--1173},
  year={1993},
  publisher={IEEE}
}

Notes

How to remove .DS_Store

find . -name '.DS_Store' -type f -delete

docstrum

Related tags

Overview

Docstrum Algorithm

Getting Started

Disclaimer

Objective

Dependencies

Process

Evaluation

Citing Docstrum

Notes

How to remove .DS_Store

Owner

Chulwoo Mike Pack

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Document Layout Analysis Projects

Learning Camera Localization via Dense Scene Matching, CVPR2021

scene-linear test images

Read Japanese manga inside browser with selectable text.

OCR powered screen-capture tool to capture information instead of images

Fine tuning keras-ocr python package with custom synthetic dataset from scratch

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Indonesian ID Card OCR using tesseract OCR

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

ERQA - Edge Restoration Quality Assessment

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

7th place solution

This tool will help you convert your text to handwriting xD

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes