Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)

Last update: Jan 02, 2023

Related tags

Deep Learning pren

Overview

Primitive Representation Learning Network (PREN)

This repository contains the code for our paper accepted by CVPR 2021

Primitive Representation Learning for Scene Text Recognition

Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao

For now we only provide code for PREN.

Requirements

python 3.7.9, pytorch 1.4.0, and torchvision 0.5.0
other libraries can be installed by

pip install -r requirements.txt

Recognition with pretrained model

We provide code for using our pretrained model to recognize text images.

The pretrained model can be downloaded via Baidu net disk: download_link key: 2txt
After downloading the pretrained model (pren.pth), put it in the "models" folder.
To recognize three samples in the "samples" folder, just run

python recog.py

The results would be

[Info] Load model from ./models/pren.pth
samples/001.jpg: ronaldo
samples/002.png: leaves
samples/003.jpg: salmon

Training

Two simple steps to train your own model:

Modify training configurations in Configs/trainConf.py
Run python train.py

To run the training code, please modify image_dir and train_list to your own training data.

image_dir is the path of training data root.

train_list is the path of a text file containing image paths (relative to image_dir) and corresponding labels.

For example, image_dir could be './samples', and train_list could be a text file with the following content

001.jpg RONALDO
002.png LEAVES
003.jpg SALMON

Evaluation

Similar to train, one can modify Configs/testConf.py and run python test.py to evaluate a model.

Acknowledgement

The code of EfficientNet is modified from EfficientNet-PyTorch, where we output multi-scale feature maps.

Citation

If you find this project helpful for your research, please cite our paper

@inproceedings{yan2021primitive,
  author    = {Yan, Ruijie and
               Peng, Liangrui and
               Xiao, Shanyu and
               Yao, Gang},
  title     = {Primitive Representation Learning for Scene Text Recognition},
  booktitle = {CVPR},
  year      = {2021}
}

Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)

Related tags

Overview

Primitive Representation Learning Network (PREN)

Requirements

Recognition with pretrained model

Training

Evaluation

Acknowledgement

Citation

Owner

Ruijie Yan

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

A state-of-the-art semi-supervised method for image recognition

CTRL-C: Camera calibration TRansformer with Line-Classification

Code for "R-GCN: The R Could Stand for Random"

Implementation of ProteinBERT in Pytorch

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

A lightweight tool to get an AI Infrastructure Stack up in minutes not days.

Implementation of Self-supervised Graph-level Representation Learning with Local and Global Structure (ICML 2021).

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

Skipgram Negative Sampling in PyTorch

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

hySLAM is a hybrid SLAM/SfM system designed for mapping

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

DP-CL(Continual Learning with Differential Privacy)

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

An Unpaired Sketch-to-Photo Translation Model

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥

Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)

Related tags

Overview

Primitive Representation Learning Network (PREN)

Requirements

Recognition with pretrained model

Training

Evaluation

Acknowledgement

Citation

Owner

Ruijie Yan

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

A state-of-the-art semi-supervised method for image recognition

CTRL-C: Camera calibration TRansformer with Line-Classification

Code for "R-GCN: The R Could Stand for Random"

Implementation of ProteinBERT in Pytorch

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

A lightweight tool to get an AI Infrastructure Stack up in minutes not days.

Implementation of Self-supervised Graph-level Representation Learning with Local and Global Structure (ICML 2021).

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

Skipgram Negative Sampling in PyTorch

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

hySLAM is a hybrid SLAM/SfM system designed for mapping

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

DP-CL(Continual Learning with Differential Privacy)

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

An Unpaired Sketch-to-Photo Translation Model

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥