CNN+Attention+Seq2Seq

Last update: Jul 14, 2022

Related tags

Computer Vision Attention_OCR

Overview

Attention_OCR

CNN+Attention+Seq2Seq

The model and its tensor transformation are shown in the figure below
It is necessary ch_ train and ch_ test the picture address format of test text to its own file path format
There is a missing data picture in the data set originally given in the test set, and there is an empty picture in the picture data set

The path in the text is as follows

/mnt/disk2/std2021/hejiabang-data/OCR/attention_img/AttentionData/59041171_106970752.jpg 项链付出了十年的苦役
/mnt/disk2/std2021/hejiabang-data/OCR/attention_img/AttentionData/38115031_1485663711.jpg 。直到台“国防部长”
/mnt/disk2/std2021/hejiabang-data/OCR/attention_img/AttentionData/22905328_1196841476.jpg 有惊无险地以21比1
/mnt/disk2/std2021/hejiabang-data/OCR/attention_img/AttentionData/41681796_2460379288.jpg 尼在门前两米处上演“
....

The training results are as follows

Owner

Tsukinousag1

GitHub Repository

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

OpenCV-ToothPaint3-Advanced-Digital-Image-Editor This application named ‘Tooth Paint’ version TP_2020.3 (64-bit) or version 3 was developed within a w

1 Nov 05, 2021

基于openpose和图像分类的手语识别项目

手语识别 0、使用到的模型 (1). openpose，作者：CMU-Perceptual-Computing-Lab https://github.com/CMU-Perceptual-Computing-Lab/openpose (2). 图像分类classification，作者：Bubbl

20 Dec 15, 2022

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

Sign Language Recognition Service This is a Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform s

1 Jan 08, 2022

a micro OCR network with 0.07mb params.

MicroOCR a micro OCR network with 0.07mb params. Layer (type) Output Shape Param # Conv2d-1 [-1, 64, 8,

29 Aug 06, 2022

Converts an image into funny, smaller amongus characters

SussyImage Converts an image into funny, smaller amongus characters Demo Mona Lisa | Lona Misa (Made up of AmongUs characters) API I've also added an

14 Aug 18, 2022

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels" Please refer to htt

1 Feb 14, 2022

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

86 Dec 28, 2022

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Virtual Mouse Using OpenCV In this project we will be using the live feed coming from the webcam to create a virtual mouse using hand tracking. Projec

8 Dec 20, 2022

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

TableNet Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from

243 Dec 30, 2022

Pixel art search engine for opengameart

Pixel Art Reverse Image Search for OpenGameArt What does the final search look like? The final search with an example can be found here. It looks like

92 Nov 06, 2022

A version of nrsc5-gui that merges the interface developed by cmnybo with the architecture developed by zefie in order to start a new baseline that is not heavily dependent upon Python processing.

NRSC5-DUI is a graphical interface for nrsc5. It makes it easy to play your favorite FM HD radio stations using an RTL-SDR dongle. It will also displa

61 Dec 22, 2022

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

PyTorch implementation of Learning by Aligning (ICCV 2021) This is an official PyTorch implementation of the paper "Learning by Aligning: Visible-Infr

30 Nov 05, 2022

A tool to make dumpy among us GIFS

Among Us Dumpy Gif Maker Made by ThatOneCalculator & Pixer415 With help from Telk, karl-police, and auguwu! Please credit this repository when you use

535 Jan 07, 2023

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image

840 Dec 26, 2022

OCR, Scene-Text-Understanding, Text Recognition

Scene-Text-Understanding Survey [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper [2014-Front.Comput.Sci] Scene Text Detection and

354 Dec 12, 2022

Generate text images for training deep learning ocr model

New version release：https://github.com/oh-my-ocr/text_renderer Text Renderer Generate text images for training deep learning OCR model (e.g. CRNN). Su

1.2k Jan 04, 2023

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

Application-Oriented Performance Benchmarks for Quantum Computing This repository contains a collection of prototypical application- or algorithm-cent

67 Nov 30, 2022

CNN+Attention+Seq2Seq

Related tags

Overview

Attention_OCR

The path in the text is as follows

The training results are as follows

Owner

Tsukinousag1

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

基于openpose和图像分类的手语识别项目

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

a micro OCR network with 0.07mb params.

Converts an image into funny, smaller amongus characters

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Pixel art search engine for opengameart

A version of nrsc5-gui that merges the interface developed by cmnybo with the architecture developed by zefie in order to start a new baseline that is not heavily dependent upon Python processing.

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

A tool to make dumpy among us GIFS

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

OCR, Scene-Text-Understanding, Text Recognition

Generate text images for training deep learning ocr model

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

SemTorch

Deep learning based page layout analysis

FastOCR is a desktop application for OCR API.