TedEval: A Fair Evaluation Metric for Scene Text Detectors

Last update: Nov 20, 2022

Overview

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Official Python 3 implementation of TedEval | paper | slides

Chae Young Lee, Youngmin Baek, and Hwalsuk Lee.

Clova AI Research, NAVER Corp.

Overview

We propose a new evaluation metric for scene text detectors called TedEval. Through separate instance-level matching policy and character-level scoring policy, TedEval solves the drawbacks of previous metrics such as IoU and DetEval. This code is based on ICDAR15 official evaluation code.

Methodology

1. Mathcing Policy

Non-exclusively gathers all possible matches of not only one-to-one but also one-to-many and many-to-one.
The threshold of both area recall and area precision are set to 0.4.
Multiline is identified and rejected when |min(theta, 180 - theta)| > 45 from Fig. 2.

2. Scoring Policy

We compute Pseudo Character Center (PCC) from word-level bounding boxes and penalize matches when PCCs are missing or overlapping.

Sample Evaluation

Experiments

We evaluated state-of-the-art scene text detectors with TedEval on two benchmark datasets: ICDAR 2013 Focused Scene Text (IC13) and ICDAR 2015 Incidental Scene Text (IC15). Detectors are listed in the order of published dates.

ICDAR 2013

Detector	Date (YY/MM/DD)	Recall (%)	Precision (%)	H-mean (%)
CTPN	16/09/12	82.1	92.7	87.6
RRPN	17/03/03	89.0	94.2	91.6
SegLink	17/03/19	65.6	74.9	70.0
EAST	17/04/11	77.7	87.1	82.5
WordSup	17/08/22	87.5	92.2	90.2
PixelLink	18/01/04	84.0	87.2	86.1
FOTS	18/01/05	91.5	93.0	92.6
TextBoxes++	18/01/09	87.4	92.3	90.0
MaskTextSpotter	18/07/06	90.2	95.4	92.9
PMTD	19/03/28	94.0	95.2	94.7
CRAFT	19/04/03	93.6	96.5	95.1

ICDAR 2015

Detector	Date (YY/MM/DD)	Recall (%)	Precision (%)	H-mean (%)
CTPN	16/09/12	85.0	81.1	67.8
RRPN	17/03/03	79.5	85.9	82.6
SegLink	17/03/19	77.1	83.9	80.6
EAST	17/04/11	82.5	90.0	86.3
WordSup	17/08/22	83.2	87.1	85.2
PixelLink	18/01/04	85.7	86.1	86.0
FOTS	18/01/05	89.0	93.4	91.2
TextBoxes++	18/01/09	82.4	90.8	86.5
MaskTextSpotter	18/07/06	82.5	91.8	86.9
PMTD	19/03/28	89.2	92.8	91.0
CRAFT	19/04/03	88.5	93.1	90.9

Frequency

Getting Started

Clone repository

git clone https://github.com/clovaai/TedEval.git

Requirements

python 3
python 3.x Polygon, Bottle, Pillow

# install
pip3 install Polygon3 bottle Pillow

Supported Annotation Type

LTRB(xmin, ymin, xmax, ymax)
QUAD(x1, y1, x2, y2, x3, y3, x4, y4)

Evaluation

Prepare data

The ground truth and the result data should be text files, one for each sample. Note that the naming rule of each text file is that there must be img_{number} in the filename and that the number indicate the image sample.

# gt/gt_img_38.txt
644,101,932,113,932,168,643,156,[email protected]
477,138,487,139,488,149,477,148,###
344,131,398,130,398,149,344,149,###
1195,148,1277,138,1277,177,1194,187,###
23,270,128,267,128,282,23,284,###

# result/res_img_38.txt
644,101,932,113,932,168,643,156,{Transcription},{Confidence}
477,138,487,139,488,149,477,148
344,131,398,130,398,149,344,149
1195,148,1277,138,1277,177,1194,187
23,270,128,267,128,282,23,284

Compress these text files.

zip gt.zip gt/*
zip result.zip result/*

Refer to gt/result.zip and gt/gt_*.zip for examples.

Run stand-alone evaluation

python script.py –g=gt/gt.zip –s=result/result.zip

Locate the path of GT and submission file using the flag -g and -s, respectively.
QUAD annotation type is used as default. To switch between {QUAD, LTRB}, add -p='{"LTRB" = False}' in the command or directly modify the default_evaluation_params() function in script.py.
If there are transcription or confidence values in your submission file, add -p='{"CONFIDENCES" = True} or -p='{"TRANSCRIPTION" = True}'.

Run Visualizer

python web.py

Place the zip file of images and GTs of the dataset named images.zip and gt.zip, respectively, in the gt directory.
Create an empty directory name output. This is where the DB, submission files, and result files will be created.
You can change the host and port number in the final line of web.py.

The file structure should then be:

.
├── gt
│   ├── gt.zip
│   └── images.zip
├── output   # empty dir
├── script.py
├── web.py
├── README.md
└── ...

Citation

@article{lee2019tedeval,
  title={TedEval: A Fair Evaluation Metric for Scene Text Detectors},
  author={Lee, Chae Young and Baek, Youngmin and Lee, Hwalsuk},
  journal={arXiv preprint arXiv:1907.01227},
  year={2019}
}

Contact us

We welcome any feedbacks to our metric. Please contact the authors via {cylee7133, youngmin.baek, hwalsuk.lee}@gmail.com. In case of code errors, open an issue and we will get to you.

License

Copyright (c) 2019-present NAVER Corp.

 Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

 The above copyright notice and this permission notice shall be included in
all copies or substantial portions of the Software.

 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
THE SOFTWARE.

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Related tags

Overview

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Overview

Methodology

1. Mathcing Policy

2. Scoring Policy

Sample Evaluation

Experiments

ICDAR 2013

ICDAR 2015

Frequency

Getting Started

Clone repository

Requirements

Supported Annotation Type

Evaluation

Prepare data

Run stand-alone evaluation

Run Visualizer

Citation

Contact us

License

Owner

Clova AI Research

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that slide and lock together.

SRA's seminar on Introduction to Computer Vision Fundamentals

Tool which allow you to detect and translate text.

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

The open source extract transaction infomation by using OCR.

MXNet OCR implementation. Including text recognition and detection.

CRAFT-Pyotorch：Character Region Awareness for Text Detection Reimplementation for Pytorch

Deskewing images with slanted content

Computer vision applications project (Flask and OpenCV)

A python screen recorder for low-end computers, provides high quality video output.

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Hiiii this is the Spanish for Linux and win 10 and in the near future the english version of PortScan my new tool on which you can see what ports are Open only with the IP adress.

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

Deep learning based page layout analysis

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Convert Text-to Handwriting Using Python