This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.

Last update: Dec 29, 2022

Related tags

Deep Learning WTW-Dataset

Overview

WTW-Dataset

This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on ICCV 2021. Here, you can download the paper, and Supplementary materials.

WTW-Dataset is the first wild table dataset for table detection and table structure recongnition tasks, which is constructed from photoing, scanning and web pages, covers 7 challenging cases like: (1)Inclined tables, (2) Curved tables, (3) Occluded tables or blurredtables (4) Extreme aspect ratio tables (5) Overlaid tables, (6) Multi-color tables and (7) Irregular tables in table structure recognition.

It contains 14581 images with the following ground-truths:

- data
 - train
  - images
  - xml (including image name, table id, table cell bbox(four vertices), start col/row, end col/row)
 - test
  - images
  - xml
  - class (7 .txt files include image names for 7 different challenging cases)

Download link is here

To be updated

Our results on WTW-dataset

Evaluation code

Data to other forms:

If you want to change to other common forms, you can do followings :

run the xmltococo.py to change the xml to json form.(To be updated)
run the xmltohtml.py to change the xml to html form.(To be updated)

Model link

Our model Cycle-Centernet has been used as Alibaba's online business software, so we can't open the model code. If you need to test, you can use the following online test link to try the different table images.

Citation:

If you use the dataset, please consider citing our work-

@InProceedings{Long_2021_ICCV,
	author = {Rujiao, Long and Wen, Wang and Nan, Xue and Feiyu, Gao and Zhibo, Yang and Yongpan, Wang and Gui-Song, Xia},
	title = {Parsing Table Structures in the Wild},
	booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
	month = {October},
	year = {2021}
}

This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.

Related tags

Overview

WTW-Dataset

To be updated

Data to other forms:

Model link

Citation:

Owner

PyTorch Code for NeurIPS 2021 paper Anti-Backdoor Learning: Training Clean Models on Poisoned Data.

For the paper entitled ''A Case Study and Qualitative Analysis of Simple Cross-Lingual Opinion Mining''

DISTIL: Deep dIverSified inTeractIve Learning.

HyDiff: Hybrid Differential Software Analysis

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

General neural ODE and DAE modules for power system dynamic modeling.

Mitsuba 2: A Retargetable Forward and Inverse Renderer

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

Intent parsing and slot filling in PyTorch with seq2seq + attention

Boosted CVaR Classification (NeurIPS 2021)

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Quantile Regression DQN a Minimal Working Example, Distributional Reinforcement Learning with Quantile Regression

The official code for paper "R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling".

A set of tools to pre-calibrate and calibrate (multi-focus) plenoptic cameras (e.g., a Raytrix R12) based on the libpleno.

deep learning for image processing including classification and object-detection etc.

Library to enable Bayesian active learning in your research or labeling work.

we propose EfficientDerain for high-efficiency single-image deraining

SARS-Cov-2 Recombinant Finder for fasta sequences

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search