Reference implementation for Structured Prediction with Deep Value Networks

Last update: Feb 02, 2022

Related tags

Overview

Deep Value Network (DVN)

This code is a python reference implementation of DVNs introduced in

Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs. Michael Gygli, Mohammad Norouzi, Anelia Angelova. ICML 2017. PDF

Note: This code implements the multi-layer perceptron version used for the multi-label classification experiments only (Section 5.1). The segmentation code was written while inside Google and thus not available.

Requirements

To run this code you need to have tensorflow, numpy, liac-arff, scikit-learn and torchfile installed. Install with

pip install -r requirements.txt

Playing around with a pre-trained Value Net

The pre-trained model for the Bibtex dataset is included in this repository. This allows you do play around with it and it's predictions, using our jupyter notebook.

Replicating the experiments in the paper

Bibtex

To replicate the numbers for bibtex provided in the paper, run:

import reproduce_results
# Reproduce results on the bibtex dataset
reproduce_results.run_bibtex()

By default, the model weights and logs are stored to ./bibtex_dvn. You can monitor the process using tensorboard with

tensorboard --logdir ./bibtex_dvn/

In order to understand the training process two quantities are important:

loss: The loss in estimating the true value of an output hypothesis
gt_f1_scores: The true f1 scores of the generated output hypothesis.

As training progresses, the generated output hypothesis should get better and better. As such, the validation performance reported here closely matches the performance of the test set. The curve should look something like this:

Bookmarks

For Bookmarks the splits are not provided on http://mulan.sourceforge.net/datasets-mlc.html. Thus, we use the splits provided by SPEN. To get the data, run:

cd mlc_datasets
wget http://www.cics.umass.edu/~belanger/icml_mlc_data.tar.gz
tar -xvf icml_mlc_data.tar.gz
cd ..

Then, you can reproduce the results with

import reproduce_results
# Reproduce results on the bookmarks dataset
reproduce_results.run_bookmarks()

The model weights and logs are stored to ./bookmarks_dvn/.

Contributors

Michael Gygli, Mohammad Norouzi, Anelia Angelova

Code by Michael Gygli

Reference implementation for Structured Prediction with Deep Value Networks

Related tags

Overview

Deep Value Network (DVN)

Requirements

Playing around with a pre-trained Value Net

Replicating the experiments in the paper

Contributors

Owner

Michael Gygli

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Sharpness-Aware Minimization for Efficiently Improving Generalization

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

A semantic segmentation toolbox based on PyTorch

SwinIR: Image Restoration Using Swin Transformer

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Code repo for "Transformer on a Diet" paper

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Official repository for "Orthogonal Projection Loss" (ICCV'21)

Face-Recognition-based-Attendance-System - An implementation of Attendance System in python.

DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time

Stochastic Scene-Aware Motion Prediction

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

PartImageNet is a large, high-quality dataset with part segmentation annotations

Official repository for Hierarchical Opacity Propagation for Image Matting

Unofficial implementation (replicates paper results!) of MINER: Multiscale Implicit Neural Representations in pytorch-lightning

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh