Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Last update: Dec 25, 2022

Related tags

Deep Learning multi-task_loss_optimizer

Overview

multi-task_losses_optimizer

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

已经实验过了，不会有cuda out of memory情况

##Pareto optimizer

from Pareto_fn import pareto_fn
w_list = [w1,w2,...]
c_list = [c1,c2,...]
[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training
new_w_list = pareto_fn(w_list,c_list,config,loss_list)
loss = 0
for i in range(len(w_list)):
    loss += new_w_list[i]*loss_list[i]
model.zero_grad()

loss.backward()
optimizer.step()

##pcgrad optimizer

from pcgrad_fn import pcgrad_fn

[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training

pcgrad_fn(model,loss_list,optimizer)

optimizer.step()

Reference

Please cite as:

@article{yu2020gradient,
  title={Gradient surgery for multi-task learning},
  author={Yu, Tianhe and Kumar, Saurabh and Gupta, Abhishek and Levine, Sergey and Hausman, Karol and Finn, Chelsea},
  journal={arXiv preprint arXiv:2001.06782},
  year={2020}
}

paper: "A Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation". RecSys, 2019, Alibaba

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Related tags

Overview

multi-task_losses_optimizer

Reference

Owner

LibMTL: A PyTorch Library for Multi-Task Learning

Breast-Cancer-Prediction

A really easy-to-use and powerful sudoku solver.

Does Oversizing Improve Prosumer Profitability in a Flexibility Market? - A Sensitivity Analysis using PV-battery System

BRepNet: A topological message passing system for solid models

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

SPTAG: A library for fast approximate nearest neighbor search

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

nanodet_plus,yolov5_v6.0

Repository of best practices for deep learning in Julia, inspired by fastai

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning

PyTorch Personal Trainer: My framework for deep learning experiments

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

Benchmark datasets, data loaders, and evaluators for graph machine learning

PyTorch implementation of "Simple and Deep Graph Convolutional Networks"

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

How to use TensorLayer

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'