A demo for end-to-end English and Chinese text spotting using ABCNet.

Last update: Oct 04, 2022

Related tags

Overview

ABCNet_Chinese

A demo for end-to-end English and Chinese text spotting using ABCNet. This is an old model that was trained a long ago, which serves as a base setting for others to train their own model on Chinese or other language. Official ABCNet_v2 models will be updated in AdelaiDet.

Installation

Install detectron2 using the provided version (support visualizing Chinese text):

python -m pip install -e d2

Install this repo:

python setup.py build develop

If the above succeed, you can now run the demo using the provided model.

Model

This is our model that can be used for evaluation or pretraining.

wget https://drive.google.com/file/d/1iWX2n_BmyltVwQmfj8_oM9z7cJlq1P0m/view?usp=sharing -O model_chn.pth

Simply put the model in the root directory of the repo.

Demo

bash demo.sh

Example results

If you successfully run the demo, you will get the output below:

Other results (same project but not using the provide model):

Document-like Ancient words, e.g., “彝文”:

Cite

If you find this repo useful, please cite:

@article{liu2021abcnet,
  title={ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting},
  author={Liu, Yuliang and Shen, Chunhua and Jin, Lianwen and He, Tong and Chen, Peng and Liu, Chongyu and Chen, Hao},
  journal={arXiv preprint arXiv:2105.03620},
  year={2021}
}

Data

We provide the converted json files of ArT, LSVT, and ReCTS that we have used for training ABCNet_Chinese.

ReCTs [images&label](1.7G) [Origin_of_dataset]
LSVT [images&label](8.2G) [Origin_of_dataset]
ArT [images&label](1.5G) [Origin_of_dataset]
SynChinese130k [images&label](25G) [Origin_of_dataset]

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact Chunhua Shen.

A demo for end-to-end English and Chinese text spotting using ABCNet.

Related tags

Overview

ABCNet_Chinese

Installation

Model

Demo

Example results

Cite

Data

License

Owner

Yuliang Liu

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

NLP library designed for reproducible experimentation management

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

A framework for implementing federated learning

Model parallel transformers in JAX and Haiku

Yet Another Neural Machine Translation Toolkit

Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning

Repositório do trabalho de introdução a NLP

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

A Plover python dictionary allowing for consistent symbol input with specification of attachment and capitalisation in one stroke.

AutoGluon: AutoML for Text, Image, and Tabular Data

The code from the whylogs workshop in DataTalks.Club on 29 March 2022

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.

Malaya-Speech is a Speech-Toolkit library for bahasa Malaysia, powered by Deep Learning Tensorflow.

End-to-End Speech Processing Toolkit

Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python