Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Last update: Sep 13, 2022

Related tags

Overview

Train aug_clip against laion400m-embeddings found here: https://laion.ai/laion-400-open-dataset/ - note that this used the base ViT-B/32 CLIP model.

Sample notebook adapted from Sadnow's 360Diffusion repo, thanks to all involved!

Latest revision: Beta 1.52 (10/11/21): https://colab.research.google.com/github/sadnow/360Diffusion/blob/main/360Diffusion_Public.ipynb

Latest highlights: Full compatibility for both 256 and 512 model for upscaling to 256,512,1024,2048, and 4096px.

Note that 4096 files aren’t quite as pretty as 2048, and they’re massive in file size. 2048 is appealing in most cases. If you intend on upscaling to anything higher than 1024, I recommend using the 512 diffusion model found in the settings-

Credits & Acknowledgements

Katherine Crowson (https://github.com/crowsonkb, https://twitter.com/RiversHaveWings)
Founder of OG Diffusion Notebook Original notebook founder; [I think] has a large involvement in both VQGAN and Diffusion!
Daniel Russell (https://github.com/russelldc, https://twitter.com/danielrussruss) Fast Diffusion Fork Founder Made the OG Fast Diffusion notebook.
Dango233 and nsheppard Contributed to Daniel’s Fast Diffusion Notebook
Sadnow (twitter.com/sadly_existent) 360Diffusion Fork Founder Forked Daniel Russel’s Fast Diffusion Notebook to include Real-ESRGAN integration-
airguitararchon (steven) Init Research
Everyone else on the VQLIPSE Discord (https://www.patreon.com/sportsracer48); Support & Research

Prior release(s): Implemented Daniel Russ’s Perlin revisions, fixed init_bug, 4096 double-pass, VRAM fixes, practical debug_mode (set to higher skip_timestep)

All edits & additions are welcome and appreciated~

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Related tags

Overview

Train aug_clip against laion400m-embeddings found here: https://laion.ai/laion-400-open-dataset/ - note that this used the base ViT-B/32 CLIP model.

Sample notebook adapted from Sadnow's 360Diffusion repo, thanks to all involved!

Owner

Peter Baylies

PyTorch implementation of Deep HDR Imaging via A Non-Local Network (TIP 2020).

Unofficial PyTorch implementation of SimCLR by Google Brain

A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

AgeGuesser: deep learning based age estimation system. Powered by EfficientNet and Yolov5

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

Redash reset for python

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

This repository contains the code to replicate the analysis from the paper "Moving On - Investigating Inventors' Ethnic Origins Using Supervised Learning"

This is project is the implementation of the DeepShift: Towards Multiplication-Less Neural Networks paper

Python implementation of ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images, AAAI2022.

This source code is implemented using keras library based on "Automatic ocular artifacts removal in EEG using deep learning"

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Character-Input - Create a program that asks the user to enter their name and their age

Turning SymPy expressions into JAX functions

R-package accompanying the paper "Dynamic Factor Model for Functional Time Series: Identification, Estimation, and Prediction"

Code for Multinomial Diffusion

Introducing neural networks to predict stock prices

Computing Shapley values using VAEAC

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking