PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Last update: Jan 05, 2023

Related tags

Text Data & NLP SITT

Overview

SITT

The repo contains official PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation.

Authors:

Overview

Recent advances in image synthesis enables one to translate images by learning the mapping between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on a variety of datasets, with results evaluated largely in a subjective manner. Relatively few works in this area, however, study the potential use of semantic image translation methods for image recognition tasks. In this paper, we explore the use of Single Image Texture Translation (SITT) for data augmentation. We first propose a lightweight model for translating texture to images based on a single input of source texture, allowing for fast training and testing. Based on SITT, we then explore the use of augmented data in long-tailed and few-shot image classification tasks. We find the proposed method is capable of translating input data into a target domain, leading to consistent improved image recognition performance. Finally, we examine how SITT and related image translation methods can provide a basis for a data-efficient, augmentation engineering approach to model training.

Usage

Environment

CUDA 10.1, pytorch 1.3.1

Dataset Preparation

	dataset	url
0	SITT leaves images from Plant Pathology 2020	download

Running

bash run.sh

More will be updated

If you find this repo useful, please cite:

@article{li2021single,
  title={Single Image Texture Translation for Data Augmentation},
  author={Li, Boyi and Cui, Yin and Lin, Tsung-Yi and Belongie, Serge},
  journal={arXiv preprint arXiv:2106.13804},
  year={2021}
}

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Related tags

Overview

SITT

Authors:

Overview

Usage

Environment

Dataset Preparation

Running

More will be updated

Owner

Boyi Li

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

Code for paper Multitask-Finetuning of Zero-shot Vision-Language Models

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Enterprise Scale NLP with Hugging Face & SageMaker Workshop series

NLP Overview

Pre-training BERT masked language models with custom vocabulary

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

OpenAI CLIP text encoders for multiple languages!

Extracting Summary Knowledge Graphs from Long Documents

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Utilities for preprocessing text for deep learning with Keras

The aim of this task is to predict someone's English proficiency based on a text input.

Python3 to Crystal Translation using Python AST Walker

a test times augmentation toolkit based on paddle2.0.

Code release for "COTR: Correspondence Transformer for Matching Across Images"

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !