Posterior predictive distributions quantify uncertainties ignored by point estimates.

Last update: Dec 06, 2022

Related tags

Overview

The Neural Testbed

Introduction

Posterior predictive distributions quantify uncertainties ignored by point estimates. The neural_testbed provides tools for the systematic evaluation of agents that generate such predictions. Crucially, these tools assess not only the quality of marginal predictions per input, but also joint predictions given many inputs. Joint distributions are often critical for useful uncertainty quantification, but they have been largely overlooked by the Bayesian deep learning community.

This library automates the evaluation and analysis of learning agents:

Synthetic neural-network-based generative model.
Evaluate predictions beyond marginal distributions.
Reference implementations of benchmark agents (with tuning).

For a more comprehensive overview, see the accompanying paper.

Technical overview

We outline the key high-level interfaces for our code in base.py:

EpistemicSampler: Generates a random sample from agent's predictive distribution.
TestbedAgent: Given data, prior_knowledge outputs an EpistemicSampler.
TestbedProblem: Reveals training_data, prior_knowledge. Can evaluate the quality of an EpistemicSampler.

If you want to evaluate your algorithm on the testbed, you simply need to define your TestbedAgent and then run it on our experiment.py

def run(agent: testbed_base.TestbedAgent,
        problem: testbed_base.TestbedProblem) -> testbed_base.ENNQuality:
  """Run an agent on a given testbed problem."""
  enn_sampler = agent(problem.train_data, problem.prior_knowledge)
  return problem.evaluate_quality(enn_sampler)

The neural_testbed takes care of the evaluation/logging within the TestbedProblem. This means that the experiment will automatically output data in the correct format. This makes it easy to compare results from different codebases/frameworks, so you can focus on agent design.

How do I get started?

If you are new to neural_testbed you can get started in our colab tutorial. This Jupyter notebook is hosted with a free cloud server, so you can start coding right away without installing anything on your machine. After this, you can follow the instructions below to get neural_testbed running on your local machine:

Installation

We have tested neural_testbed on Python 3.7. To install the dependencies:

Optional: We recommend using a Python virtual environment to manage your dependencies, so as not to clobber your system installation:
```
python3 -m venv neural_testbed
source neural_testbed/bin/activate
pip install --upgrade pip setuptools
```

Install neural_testbed directly from github:

git clone https://github.com/deepmind/neural_testbed.git
cd neural_testbed
pip install .

Optional: run the tests by executing ./test.sh from the neural_testbed main directory.

Baseline agents

In addition to our testbed code, we release a collection of benchmark agents. These include the full sets of hyperparameter sweeps necessary to reproduce the paper's results, and can serve as a great starting point for new research. You can have a look at these implementations in the agents/factories/ folder.

We recommended you get started with our colab tutorial. After intallation you can also run an agent directly by executing the following command from the main directory of neural_testbed:

python -m neural_testbed.experiments.run --agent_name=mlp

By default, this will save the results for that agent to csv at /tmp/neural_testbed. You can control these options by flags in the run file. In particular, you can run the agent on the whole sweep of tasks in the Neural Testbed by specifying the flag --problem_id=SWEEP.

Citing

If you use neural_testbed in your work, please cite the accompanying paper:

@misc{osband2021evaluating,
      title={Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?},
      author={Ian Osband and Zheng Wen and Seyed Mohammad Asghari and Vikranth Dwaracherla and Botao Hao and Morteza Ibrahimi and Dieterich Lawson and Xiuyuan Lu and Brendan O'Donoghue and Benjamin Van Roy},
      year={2021},
      eprint={2110.04629},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Posterior predictive distributions quantify uncertainties ignored by point estimates.

Related tags

Overview

The Neural Testbed

Introduction

Technical overview

How do I get started?

Installation

Baseline agents

Citing

Owner

DeepMind

Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

A hobby project which includes a hand-gesture based virtual piano using a mobile phone camera and OpenCV library functions

Bib-parser - Convenient script to parse .bib files with the ACM Digital Library like metadata

Multiview 3D object detection on MultiviewC dataset through moft3d.

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

TF2 implementation of knowledge distillation using the "function matching" hypothesis from the paper Knowledge distillation: A good teacher is patient and consistent by Beyer et al.

[NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".

Mining-the-Social-Web-3rd-Edition - The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Official codebase for Pretrained Transformers as Universal Computation Engines.

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

A tool to analyze leveraged liquidity mining and find optimal option combination for hedging.

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Snapchat-filters-app-opencv-python - Here we used opencv and other inbuilt python modules to create filter application like snapchat

Learning Chinese Character style with conditional GAN

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

A unet implementation for Image semantic segmentation