This project helps to colorize grayscale images using multiple exemplars.

Last update: Aug 05, 2022

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Pretrained Model

[Jitendra Chautharia](IIT Jodhpur)^1,3,

Prerequisites

Python 3.6+
Nvidia GPU + CUDA, CuDNN

Installation

First use the following commands to prepare the environment:

conda create -n ColorVid python=3.6
source activate ColorVid
pip install -r requirements.txt

Then, download the pretrained models from this link, unzip the file and place the files into the corresponding folders:

video_moredata_l1 under the checkpoints folder
vgg19_conv.pth and vgg19_gray.pth under the data folder

Data Preparation

In order to colorize your own video, it requires to extract the video frames, and provide a reference image as an example.

Place your Target grayscale image into one folder, e.g., ./exp_sample/target
Place your reference images into another folder, e.g., ./exp_sample/references

If you want to automatically retrieve color images, you can try the retrieval algorithm from this link which will retrieve similar images from the ImageNet dataset. Or you can try this link on your own image database.

Test

python test.py --image-size [image-size] \
               --clip_path [path-to-target-grayscale-image] \
               --ref_path [path-to-reference] \
               --output_path [path-to-output]

We provide several sample video clips with corresponding references. For example, one can colorize one sample legacy video using:

python test.py --clip_path ./exp_sample/target \
               --ref_path ./exp_sample/references \
               --output_path ./exp_sample/output

Note that we use 216*384 images for training, which has aspect ratio of 1:2. During inference, we scale the input to this size and then rescale the output back to the original size.

Train

We also provide training code for reference. The training can be started by running:

python --data_root [root of video samples] \
       --data_root_imagenet [root of image samples] \
       --gpu_ids [gpu ids] \

We do not provide the full video dataset due to the copyright issue. For image samples, we retrieve semantically similar images from ImageNet using this repository. Still, one can refer to our code to understand the detailed procedure of augmenting the image dataset to mimic the video frames.

This project helps to colorize grayscale images using multiple exemplars.

Related tags

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Prerequisites

Installation

Data Preparation

Test

Train

Comparison with State-of-the-Arts

Owner

jitendra chautharia

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

PyKaldi GOP-DNN on Epa-DB

Single-stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

Quantized models with python

A sketch extractor for anime/illustration.

Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

You can draw the corresponding bounding box into the image and save it according to the result file (txt format) run by the tracker.

Using pretrained GROVER to extract the atomic fingerprints from molecule

Aircraft design optimization made fast through modern automatic differentiation

Explore extreme compression for pre-trained language models

Learning Super-Features for Image Retrieval

ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers

A Python wrapper for Google Tesseract

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Face recognition with trained classifiers for detecting objects using OpenCV

Defending graph neural networks against adversarial attacks (NeurIPS 2020)

WormMovementSimulation - 3D Simulation of Worm Body Movement with Neurons attached to its body