Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Last update: Jan 06, 2023

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Splice is a method for semantic appearance transfer, as described in Splicing ViT Features for Semantic Appearance Transfer (link to paper).

Given two input images—a source structure image and a target appearance image–our method generates a new image in which the structure of the source image is preserved, while the visual appearance of the target image is transferred in a semantically aware manner. That is, objects in the structure image are “painted” with the visual appearance of semantically related objects in the appearance image. Our method leverages a self-supervised, pre-trained ViT model as an external semantic prior. This allows us to train our generator only on a single input image pair, without any additional information (e.g., segmentation/correspondences), and without adversarial training. Thus, our framework can work across a variety of objects and scenes, and can generate high quality results in high resolution (e.g., HD).

Getting Started

Installation

git clone https://github.com/omerbt/Splice.git
pip install -r requirements.txt

Run examples

Run the following command to start training

python train.py --dataroot datasets/cows

Intermediate results will be saved to /out/output.png during optimization. The frequency of saving intermediate results is indicated in the save_epoch_freq flag of the configuration.

Sample Results

Citation

@article{Splice2022,
    author = {Tumanyan, Narek
              and Bar-Tal, Omer
              and Bagon, Shai
              and Dekel, Tali
              },
    title = {Splicing ViT Features for Semantic Appearance Transfer}, 
    journal = {arXiv preprint arXiv:2201.00424},
    year  = {2022}
}

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Related tags

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Getting Started

Installation

Run examples

Sample Results

Citation

Owner

Omer Bar Tal

Annotate with anyone, anywhere.

Matching python environment code for Lux AI 2021 Kaggle competition, and a gym interface for RL models.

Extending JAX with custom C++ and CUDA code

BLEURT is a metric for Natural Language Generation based on transfer learning.

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

Shape-Adaptive Selection and Measurement for Oriented Object Detection

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

mmfewshot is an open source few shot learning toolbox based on PyTorch

PlenOctrees: NeRF-SH Training & Conversion

This repo contains research materials released by members of the Google Brain team in Tokyo.

DIVeR: Deterministic Integration for Volume Rendering

Pytorch Implementations of large number classical backbone CNNs, data enhancement, torch loss, attention, visualization and some common algorithms.

Learning Temporal Consistency for Low Light Video Enhancement from Single Images (CVPR2021)

Bald-to-Hairy Translation Using CycleGAN

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Navigating StyleGAN2 w latent space using CLIP

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

Official repository of IMPROVING DEEP IMAGE MATTING VIA LOCAL SMOOTHNESS ASSUMPTION.

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity

Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models