Code for the Lovász-Softmax loss (CVPR 2018)

Last update: Jan 04, 2023

Overview

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

Maxim Berman, Amal Rannen Triki, Matthew B. Blaschko

ESAT-PSI, KU Leuven, Belgium.

Published in CVPR 2018. See project page, arxiv paper, paper on CVF open access.

PyTorch implementation of the loss layer (pytorch folder)

Files included:

lovasz_losses.py: Standalone PyTorch implementation of the Lovász hinge and Lovász-Softmax for the Jaccard index
demo_binary.ipynb: Jupyter notebook showcasing binary training of a linear model, with the Lovász Hinge and with the Lovász-Sigmoid.
demo_multiclass.ipynb: Jupyter notebook showcasing multiclass training of a linear model with the Lovász-Softmax

The binary lovasz_hinge expects real-valued scores (positive scores correspond to foreground pixels).

The multiclass lovasz_softmax expect class probabilities (the maximum scoring category is predicted). First use a Softmax layer on the unnormalized scores.

TensorFlow implementation of the loss layer (tensorflow folder)

Files included:

lovasz_losses_tf.py: Standalone TensorFlow implementation of the Lovász hinge and Lovász-Softmax for the Jaccard index
demo_binary_tf.ipynb: Jupyter notebook showcasing binary training of a linear model, with the Lovász Hinge and with the Lovász-Sigmoid.
demo_multiclass_tf.ipynb: Jupyter notebook showcasing the application of the multiclass loss with the Lovász-Softmax

Warning: the losses values and gradients have been tested to be the same as in PyTorch (see notebooks), however we have not used the TF implementation in a training setting.

Usage

See the demos for simple proofs of principle.

FAQ

How should I use the Lovász-Softmax loss?

The loss can be optimized on its own, but the optimal optimization hyperparameters (learning rates, momentum) might be different from the best ones for cross-entropy. As discussed in the paper, optimizing the dataset-mIoU (Pascal VOC measure) is dependent on the batch size and number of classes. Therefore you might have best results by optimizing with cross-entropy first and finetuning with our loss, or by combining the two losses.

See for example how the work Land Cover Classification From Satellite Imagery With U-Net and Lovasz-Softmax Loss by Alexander Rakhlin et al. used our loss in the CVPR 18 DeepGlobe challenge.

Inference in Tensorflow is very slow...

Compiling from Tensorflow master (or using a future distribution that includes commit tensorflow/[email protected]) should solve this problem; see issue #6.

Citation

Please cite

@inproceedings{berman2018lovasz,
  title={The Lov{\'a}sz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks},
  author={Berman, Maxim and Rannen Triki, Amal and Blaschko, Matthew B},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={4413--4421},
  year={2018}
}

Code for the Lovász-Softmax loss (CVPR 2018)

Related tags

Overview

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

PyTorch implementation of the loss layer (pytorch folder)

TensorFlow implementation of the loss layer (tensorflow folder)

Usage

FAQ

Citation

Owner

Maxim Berman

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Generative Autoregressive, Normalized Flows, VAEs, Score-based models (GANVAS)

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

MarcoPolo is a clustering-free approach to the exploration of bimodally expressed genes along with group information in single-cell RNA-seq data

Magisk module to enable hidden features on Android 12 Developer Preview 1.

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

This tutorial aims to learn the basics of deep learning by hands, and master the basics through combination of lectures and exercises

This package contains a PyTorch Implementation of IB-GAN of the submitted paper in AAAI 2021

IGCN : Image-to-graph convolutional network

[Nature Machine Intelligence' 21] "Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence"

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

Implementation of a Transformer using ReLA (Rectified Linear Attention)

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model

A script that trains a model to recognize handwritten digits using the MNIST data set.

Semantically Contrastive Learning for Low-light Image Enhancement

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT