ArcaneGAN by Alex Spirin

Last update: Dec 28, 2022

Related tags

Deep Learning ArcaneGAN

Overview

ArcaneGAN by Alex Spirin

Changelog

2021-12-12 ArcaneGAN v0.3 is live
2021-12-09 Thanks to ak92501 we now have a huggingface demo

ArcaneGAN v0.3

Videos processed by the huggingface video inference colab.

obama2.mp4

ryan2.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

ArcaneGAN v0.2

The release is here

Implementation Details

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Comments

How to convert the FastAI model to Pytorch JIT

Hi,

I trained a model with unet_learner but I can't convert it to jit.

I run the following code: torch.jit.save(torch.jit.script(learn.model), 'jit.pt')

Here is the error:

UnsupportedNodeError: GeneratorExp aren't supported: File "/usr/local/lib/python3.7/dist-packages/fastai/callbacks/hooks.py", line 21 "Applieshook_functomodule,input,output." if self.detach: input = (o.detach() for o in input ) if is_listy(input ) else input.detach() ~ <--- HERE output = (o.detach() for o in output) if is_listy(output) else output.detach() self.stored = self.hook_func(module, input, output)

May I know how you convert it to a jit model? Thanks

opened by ramtiin 2
Ошибка

Добрый вечер.В ArcaneGAN на colab for videos,выдаёт ошибку:

RuntimeError: CUDA out of memory. Tried to allocate 2.80 GiB (GPU 0; 11.17 GiB total capacity; 5.74 GiB already allocated; 2.21 GiB free; 8.44 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Помогите пожалуйста!

opened by Zzip7 2
How do you change the style of the whole image

Nice work! My only confusion is how you change the style of the whole image instead of just the face. Usually, StyleGAN generates aligned face images by fine-tuning the FFHQ checkpoint. How does the pix2pix model trained with these face image pairs work with the full image or frame.

opened by zhanglonghao1992 2
Architecture for video

Hi, what does the architecture look like? Is it similar to Pix2Pix? And for processing of the video, are you doing anything extra to make sure the frames are consistent?

opened by unography 2
How to prevent eyes occur in nose?

Hello, I try your model and it's amazing, but I find in some pictures if the nose is too big, there will be eyes in the nose. I try to lower the 'target_face' and it can work. But the details like the light of the eyes and background will also lose when I lower the 'target_face'. So I wonder is there a way to prevent the eyes occurs in the nose and keep the details in the meantime?

opened by Folkfive 1
support arbitrary image size?

Great work!

The unet prediction result will be cropped to be the same size as the training input, e.g. 256 or 512. For arbitrary image size (e.g. 1280*720), how to config or set the model to output the same size of the input image as your colab did? Thank you.

opened by foobarhe 1
RuntimeError: CUDA out of memory

Добрый вечер.Извините,это опять я.Снова эта ошибка появляется.Можно ли,самому эту ошибку решать?Или исправлять можете только вы?Обьясните пожалуйста подробно.

opened by Zzip7 1
about the paired datasets generated by stylegan

how do you make sure the background and expression similarity between the generated input(face) and target(style face) ? I find that the style is too weak when less finetune and the similarity is too weak when more finetune, how do you solve it ? Would you like to share the paired datasets generated code with me ? thanks a lot ~

opened by Leocien 1
Any news for training code?

Interesting topic... I wonder how you trained the model, especially the augmentation part. Fixed crop limitation is a well-known problem and would like to know how you handle it. :)

opened by dongyun-kim-arch 0
tuple issue

Was trying the ArcaneGan video colab but I am having a tuple issue can you please help, i am really excited to try the Arcane video can you please help out

opened by mau021 0
What GPU is used for training?

Hi,

I want to train the Fastai u-net model. However, when I try to train the critic (learn_critic.fit_one_cycle(6, 1e-3)), I get the following error:

CUDA out of memory. Tried to allocate 4.00 GiB (GPU 0; 14.76 GiB total capacity; 9.78 GiB already allocated; 891.75 MiB free; 12.57 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

The GPU is a Tesla T4 with 16 GB of VRAM. My batch size is 4 and the training images size is 512*512. I also tried with lower numbers, but I'm still getting the same error.

opened by ramtiin 2
How to make the style stronger?

The following are input image, my training output from pair label supervision, and the output from your test model。 I trained my model (Super-Resolution model) on the images from your model outputs, I find it difficult to change the facial features。 Like the eyes and face texture are changed, how to do it ? I use L1Loss (weight is 1) + PerceptualLoss (weight is 1)+ GANLoss (weight is 0.1),

opened by xuanandsix 1

Releases(v0.4)

v0.4(Dec 25, 2021)
ArcaneGAN v0.4

The main differences are:

lighter styling (closer to original input)

sharper result

happier faces

reduced childish eyes effect

reduced stubble on feminine faces

increased temporal stability on videos

reduced mouth\teeth artifacts

Image samples

v0.3 vs v0.4

Video samples

https://user-images.githubusercontent.com/11751592/146966428-f4e27929-19dd-423f-a772-8aee709d2116.mp4

https://user-images.githubusercontent.com/11751592/146966462-6511998e-77f5-4fd2-8ad9-5709bf0cd172.mp4
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.4.jit(59.75 MB)
v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

This is a stronger-styled version. It performs okay on videos, though visible flickering is present. Here are some video examples.

https://user-images.githubusercontent.com/11751592/145702737-c02b8b00-ad30-4358-98bf-97c8ad7fefdf.mp4

https://user-images.githubusercontent.com/11751592/145702740-afd3377d-d117-467d-96ca-045e25d85ac6.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.3.jit(79.40 MB)
v0.2(Dec 7, 2021)

ArcaneGAN v0.2 This version is a bit better at doing something other than making images darker :D

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.2.jit(79.52 MB)
v0.1(Dec 6, 2021)

ArcaneGAN v0.1 This is a proof of concept release. The model is in beta (which means it's beta than nothin')

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.1.jit(79.53 MB)

Owner

Alex

GitHub Repository

Composing methods for ML training efficiency

MosaicML Composer contains a library of methods, and ways to compose them together for more efficient ML training.

2.8k Jan 08, 2023

🏖 Keras Implementation of Painting outside the box

Keras implementation of Image OutPainting This is an implementation of Painting Outside the Box: Image Outpainting paper from Standford University. So

1.1k Dec 10, 2022

TorchMetrics is a collection of 25+ PyTorch metrics implementations and an easy-to-use API to create custom metrics.

Machine learning metrics for distributed, scalable PyTorch applications.

1.2k Jan 06, 2023

Medical image analysis framework merging ANTsPy and deep learning

ANTsPyNet A collection of deep learning architectures and applications ported to the python language and tools for basic medical image processing. Bas

118 Dec 24, 2022

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation (Accepted by BMVC'21) Abstract: Images acquir

10 Dec 08, 2022

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

Posterior temperature optimized Bayesian models for inverse problems in medical imaging Max-Heinrich Laves*, Malte Tölle*, Alexander Schlaefer, Sandy

6 Sep 19, 2022

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

Exploring Classification Equilibrium in Long-Tailed Object Detection (LOCE, ICCV 2021) Paper Introduction The conventional detectors tend to make imba

52 Nov 21, 2022

Spam your friends and famly and when you do your famly will disown you and you will have no friends.

SpamBot9000 Spam your friends and family and when you do your family will disown you and you will have no friends. Terms of Use Disclaimer: Please onl

0 Jun 09, 2022

《DeepViT: Towards Deeper Vision Transformer》(2021)

DeepViT This repo is the official implementation of "DeepViT: Towards Deeper Vision Transformer". The repo is based on the timm library (https://githu

109 Dec 02, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image.

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

7 May 29, 2022

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

Neural Fractal Create Fractals Using Complex-Valued Neural Networks! Home Page Features Define Dynamical Systems Using Complex-Valued Neural Networks

10 Dec 17, 2022

Testbed of AI Systems Quality Management

qunomon Description A testbed for testing and managing AI system qualities. Demo Sorry. Not deployment public server at alpha version. Requirement Ins

15 Nov 27, 2021

PyTorch implementation for 3D human pose estimation

Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach This repository is the PyTorch implementation for the network presented in:

579 Dec 22, 2022

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

DCSR: Dual Camera Super-Resolution Implementation for our ICCV 2021 oral paper: Dual-Camera Super-Resolution with Aligned Attention Modules paper | pr

110 Dec 20, 2022

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation

76 Dec 05, 2022

Gradient Inversion with Generative Image Prior

Gradient Inversion with Generative Image Prior This repository is an implementation of "Gradient Inversion with Generative Image Prior", accepted to N

25 Jan 09, 2023

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

EDVR has been merged into BasicSR. This GitHub repo is a mirror of BasicSR. Recommend to use BasicSR, and open issues, pull requests, etc in BasicSR.

1.4k Dec 25, 2022

ArcaneGAN by Alex Spirin

Related tags

Overview

ArcaneGAN by Alex Spirin

ArcaneGAN v0.3

Image samples

ArcaneGAN v0.2

Implementation Details

Comments

Releases(v0.4)

v0.4(Dec 25, 2021)

ArcaneGAN v0.4

Image samples

Video samples

v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

Image samples

v0.2(Dec 7, 2021)

v0.1(Dec 6, 2021)

Owner

Alex

Composing methods for ML training efficiency

🏖 Keras Implementation of Painting outside the box

TorchMetrics is a collection of 25+ PyTorch metrics implementations and an easy-to-use API to create custom metrics.

Medical image analysis framework merging ANTsPy and deep learning

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

Spam your friends and famly and when you do your famly will disown you and you will have no friends.

《DeepViT: Towards Deeper Vision Transformer》(2021)

This project uses Template Matching technique for object detecting by detection of template image over base image.

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

Testbed of AI Systems Quality Management

PyTorch implementation for 3D human pose estimation

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

Gradient Inversion with Generative Image Prior

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

A Home Assistant custom component for Lobe. Lobe is an AI tool that can classify images.

Interpretation of T cell states using reference single-cell atlases