A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Last update: Jan 05, 2023

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Jianqi Ma, Zhetong Liang, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China & OPPO Research

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the ' $TATT_ROOT$ /', place the pretrained weights from recognizer in ' $TATT_ROOT$ /'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TATT.sh
./train_TATT.sh

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution},
author={Ma, Jianqi and Zhetong, Liang and Zhang, Lei},
journal={},
year={2022}
}

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Recovering TextZoom samples

Environment:

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

MA Jianqi, shiki

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

RCD: Relation Map Driven Cognitive Diagnosis for Intelligent Education Systems

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

A simple program for training and testing vit

[ICLR'19] Trellis Networks for Sequence Modeling

Codeflare - Scale complex AI/ML pipelines anywhere

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Syntax-Aware Action Targeting for Video Captioning

A Framework for Encrypted Machine Learning in TensorFlow

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API

Tensorflow 2.x implementation of Vision-Transformer model

[TIP2020] Adaptive Graph Representation Learning for Video Person Re-identification

Problem-943.-ACMP - Problem 943. ACMP

Build and run Docker containers leveraging NVIDIA GPUs

PyTorch experiments with the Zalando fashion-mnist dataset

Code for "Localization with Sampling-Argmax", NeurIPS 2021

A python-image-classification web application project, written in Python and served through the Flask Microframework. This Project implements the VGG16 covolutional neural network, through Keras and Tensorflow wrappers, to make predictions on uploaded images.

Tensorflow Tutorials using Jupyter Notebook

Keras code and weights files for popular deep learning models.