Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

Automatic Attendance marker for LMS Practice School Division, BITS Pilani

Pytorch domain adaptation package

Kaggle-titanic - A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.

Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)

NAS-HPO-Bench-II is the first benchmark dataset for joint optimization of CNN and training HPs.

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Contrastive Learning for Metagenomic Binning

PyTorch Implementation for Deep Metric Learning Pipelines

Python Interview Questions

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

Breast Cancer Detection 🔬 ITI "AI_Pro" Graduation Project

yolov5 deepsort 行人车辆跟踪检测计数

An educational tool to introduce AI planning concepts using mobile manipulator robots.

MTA:SA Server Configer.

Evaluating Privacy-Preserving Machine Learning in Critical Infrastructures: A Case Study on Time-Series Classification

This repository contains the source code for the paper First Order Motion Model for Image Animation

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Pytorch implementation for M^3L

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

Automatic Attendance marker for LMS Practice School Division, BITS Pilani

Pytorch domain adaptation package

Kaggle-titanic - A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.

Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)

NAS-HPO-Bench-II is the first benchmark dataset for joint optimization of CNN and training HPs.

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Contrastive Learning for Metagenomic Binning

PyTorch Implementation for Deep Metric Learning Pipelines

Python Interview Questions

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

Breast Cancer Detection 🔬 ITI "AI_Pro" Graduation Project

yolov5 deepsort 行人 车辆 跟踪 检测 计数

An educational tool to introduce AI planning concepts using mobile manipulator robots.

MTA:SA Server Configer.

Evaluating Privacy-Preserving Machine Learning in Critical Infrastructures: A Case Study on Time-Series Classification

This repository contains the source code for the paper First Order Motion Model for Image Animation

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Pytorch implementation for M^3L

yolov5 deepsort 行人车辆跟踪检测计数