Vietnamese Language Detection and Recognition

Last update: May 27, 2022

Overview

Table of Content

Introduction (Khôi viết)
Dataset (đổi link thui thành 3k5 ảnh mình)
Getting Started (An Viết)
- Requirements
- Usage Example
Training & Evaluation (Tấn + Quỳnh viết)
Acknowledgement (đổi link thui)

Dictionary-guided Scene Text Recognition

We propose a novel dictionary-guided sense text recognition approach that could be used to improve many state-of-the-art models.


Comparison between the traditional approach and our proposed approach.

Details of the dataset construction, model architecture, and experimental results can be found in our following paper:

@inproceedings{m_Nguyen-etal-CVPR21,
      author = {Nguyen Nguyen and Thu Nguyen and Vinh Tran and Triet Tran and Thanh Ngo and Thien Nguyen and Minh Hoai},
      title = {Dictionary-guided Scene Text Recognition},
      year = {2021},
      booktitle = {Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition (CVPR)},
    }

Please CITE our paper whenever our dataset or model implementation is used to help produce published results or incorporated into other software.

Dataset

We introduce ✨ a new VinText dataset.

By downloading this dataset, USER agrees:

to use this dataset for research or educational purposes only

to not distribute or part of this dataset in any original or modified form.

and to cite our paper whenever this dataset are employed to help produce published results.

Name	#imgs	#text instances	Examples
VinText	2000	About 56000

Detail about ✨ VinText dataset can be found in our paper. Download Converted dataset to try with our model

Dataset variant	Input format	Link download
Original	x1,y1,x2,y2,x3,y3,x4,y4,TRANSCRIPT	Download here
Converted dataset	COCO format	Download here

VinText

Extract data and copy folder to folder datasets/

datasets
└───vintext
	└───test.json
		│train.json
		|train_images
		|test_images
└───evaluation
	└───gt_vintext.zip

Getting Started

Requirements

python=3.7
torch==1.4.0
detectron2==0.2

Installation

conda create -n dict-guided -y python=3.7
conda activate dict-guided
conda install -y pytorch torchvision cudatoolkit=10.0 -c pytorch
python -m pip install ninja yacs cython matplotlib tqdm opencv-python shapely scipy tensorboardX pyclipper Polygon3 weighted-levenshtein editdistance

# Install Detectron2
python -m pip install detectron2==0.2 -f \
  https://dl.fbaipublicfiles.com/detectron2/wheels/cu100/torch1.4/index.html

Check out the code and install:

git clone https://github.com/nguyennm1024/dict-guided.git
cd dict-guided
python setup.py build develop

Download vintext pre-trained model

trained_model.

Usage

Prepare folders

mkdir sample_input
mkdir sample_output

Copy your images to sample_input/. Output images would result in sample_output/

python demo/demo.py --config-file configs/BAText/VinText/attn_R_50.yaml --input sample_input/ --output sample_output/ --opts MODEL.WEIGHTS path-to-trained_model-checkpoint


Qualitative Results on VinText.

Training and Evaluation

Training

For training, we employed the pre-trained model tt_attn_R_50 from the ABCNet repository for initialization.

python tools/train_net.py --config-file configs/BAText/VinText/attn_R_50.yaml MODEL.WEIGHTS path_to_tt_attn_R_50_checkpoint

Example:

python tools/train_net.py --config-file configs/BAText/VinText/attn_R_50.yaml MODEL.WEIGHTS ./tt_attn_R_50.pth

Trained model output will be saved in the folder output/batext/vintext/ that is then used for evaluation

Evaluation

python tools/train_net.py --eval-only --config-file configs/BAText/VinText/attn_R_50.yaml MODEL.WEIGHTS path_to_trained_model_checkpoint

Example:

python tools/train_net.py --eval-only --config-file configs/BAText/VinText/attn_R_50.yaml MODEL.WEIGHTS ./output/batext/vintext/trained_model.pth

Acknowledgement

This repository is built based-on ABCNet

Vietnamese Language Detection and Recognition

Related tags

Overview

Table of Content

Dictionary-guided Scene Text Recognition

Dataset

VinText

Getting Started

Requirements

Installation

Check out the code and install:

Download vintext pre-trained model

Usage

Training and Evaluation

Training

Evaluation

Acknowledgement

Owner

天池2021"全球人工智能技术创新大赛"【赛道一】：医学影像报告异常检测 - 第三名解决方案

Msos searcher - A half-hearted attempt at finding a magic square of squares

Learning Camera Localization via Dense Scene Matching, CVPR2021

Toolbox for OCR post-correction

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

Image Smoothing and Blurring Using OpenCV

Markup for note taking

FOTS Pytorch Implementation

Character Segmentation using TensorFlow

Text page dewarping using a "cubic sheet" model

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Automatically fishes for you while you are afk :)

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

Some codes from PyImageSearch course's and external projects.

MeshToGeotiff - A fast Python algorithm to convert a 3D mesh into a GeoTIFF

Generates a message from the infamous Jerma Impostor image

Convert Text-to Handwriting Using Python

Hiiii this is the Spanish for Linux and win 10 and in the near future the english version of PortScan my new tool on which you can see what ports are Open only with the IP adress.

Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals