PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence) and pre-trained model on ImageNet dataset

Last update: Jul 28, 2022

Overview

Reference-Based-Sketch-Image-Colorization-ImageNet

This is a PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence)

We will provide pre-trained model on ImageNet dataset shortly

1 Training

Prepare the ImageNet dataset (i.e., upload ILSVRC2012_train_256 folder to your server)
Download the PyTorch official pre-trained VGG-16 model, and then rename it to 'vgg16_pretrained.pth'

(torchvision webpage: https://github.com/pytorch/vision/blob/main/torchvision/models/vgg.py)

(download webpage: https://download.pytorch.org/models/vgg16-397923af.pth) (this is good)

Change the parameter in yaml file and run

(--vgg_name -> your VGG-16 model path)

(--baseroot_train -> your ImageNet dataset path, i.e., ILSVRC2012_train_256 path)

sh sbatch_run.sh or sh local_run.sh

By the way, I use 8 Titan GPUs to train the network with batch size of 32, epoch of 40. It takes approximately 16 days!

The forward of GAN discriminator and VGG-16 take a lot of time, which are used to compute GAN loss and perceptual loss, etc.

2 Validation

Prepare the references with same names to ImageNet test10k
Change the parameter in yaml file and run

sh val_run.sh or sh validation.sh

PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence) and pre-trained model on ImageNet dataset

Related tags

Overview

Reference-Based-Sketch-Image-Colorization-ImageNet

1 Training

2 Validation

Owner

Yuzhi ZHAO

Attentive Implicit Representation Networks (AIR-Nets)

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

A visualisation tool for Deep Reinforcement Learning

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

[ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.

A motion tracking system for any arbitaray points in a video frame.

Implementation of Shape Generation and Completion Through Point-Voxel Diffusion

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"

Active learning for Mask R-CNN in Detectron2

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

Prior-Guided Multi-View 3D Head Reconstruction

Anchor-free Oriented Proposal Generator for Object Detection

Project Aquarium is a SUSE-sponsored open source project aiming at becoming an easy to use, rock solid storage appliance based on Ceph.

Joint Gaussian Graphical Model Estimation: A Survey

Liver segmentation using MONAI and pytorch

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"