iftopt

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations.

Requirements

Python 3.7+
PyTorch 1.x

Installation

$ pip install git+https://github.com/money-shredder/iftopt.git

Usage

Assuming a bi-level optimization of the form:

y* = argmin_{y} val_loss(x*, y), where x* = argmin_{x} train_loss(x, y).

To solve for the optimal x* and y* in the optimization problem, we can implement the following with iftopt:

from iftopt import HyperOptimizer
train_lr = val_lr = 0.1
# parameter to minimize the training loss
x = torch.nn.Parameter(...)
# hyper-parameter to minimize the validation loss
y = torch.nn.Parameter(...)
# training loss optimizer
opt = torch.optim.SGD([x], lr=train_lr)
# validation loss optimizer
hopt = HyperOptimizer(
    [y], torch.optim.SGD([y], lr=val_lr), vih_lr=0.1, vih_iterations=5)
# outer optimization loop for y
for _ in range(...):
    # inner optimization loop for x
    for _ in range(...):
        z = train_loss(x, y)
        # inner optimization step for x
        opt.zero_grad()
        z.backward()
        opt.step()
    # outer optimization step for y
    hopt.set_train_parameters([x])
    z = train_loss(x, y)
    hopt.train_step(z)
    v = val_loss(x, y)
    hopt.val_step(v)
    hopt.grad()
    hopt.step()

For a concrete simple example, please check out and run demo.py, where

train_loss = lambda x, y: (x + y) ** 2
val_loss = lambda x, y: x ** 2

with x = y = 1.0 initially. It will generate a video demo.mp4 showing the optimization trajectory in the animation below. Note that although the hyper-parameter y does not have a direct gradient w.r.t. the validation loss, iftopt can still minimize the validation loss by computing the hyper-gradient via implicit function theorem.

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

Related tags

Overview

iftopt

Requirements

Installation

Usage

Owner

The Money Shredder Lab

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

Infrastructure as Code (IaC) for a self-hosted version of Gnosis Safe on AWS

PAIRED in PyTorch 🔥

Recommendation algorithms for large graphs

Covid-19 Test AI (Deep Learning - NNs) Software. Accuracy is the %96.5, loss is the 0.09 :)

Exponential Graph is Provably Efficient for Decentralized Deep Training

Pytorch library for seismic data augmentation

Pytorch implementation of "Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet"

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

PyTorch implementation of ARM-Net: Adaptive Relation Modeling Network for Structured Data.

Dense matching library based on PyTorch

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

《Single Image Reflection Removal Beyond Linearity》(CVPR 2019)

Tackling data scarcity in Speech Translation using zero-shot multilingual Machine Translation techniques

Deep Q-learning for playing chrome dino game

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Official Pytorch implementation of RePOSE (ICCV2021)

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution