Code for "Universal inference meets random projections: a scalable test for log-concavity"

Last update: Nov 21, 2021

Overview

How to use this repository

This repository contains code to replicate the results of "Universal inference meets random projections: a scalable test for log-concavity" by Robin Dunn, Larry Wasserman, and Aaditya Ramdas.

Folder contents

batch_scripts: Contains SLURM batch scripts to run the simulations. Scripts are labeled by the figure for which their simulations produce data. These scripts run the code in sim_code, using the parameters in sim_params.
data: Output of simulations.
plot_code: Reads simulation outputs from data and reproduces all figures in the paper. Plots are saved to plots folder.
plots: Contains all plots in paper.
sim_code: R code to run simulations. Simulation output is saved to data folder.
sim_params: Parameters for simulations. Each row contains a single choice of parameters. The scripts in sim_code read in these files, and the scripts in batch_scripts loop through all choices of parameters.

How do I ...

Produce the simulations for a given figure?

In the batch_scripts folder, scripts are labeled by the figure for which they simulate data. Run all batch scripts corresponding to the figure of interest. The allocated run time is estimated from the choice of parameters for which the code has the longest run time. Many scripts will run faster than this time. The files in sim_code each contain progress bars to estimate the remaining run time. You may wish to start running these files outside of a batch submission to understand the run time on your computing system.

Alternatively, to run the code without using a job submission system, click on any .sh file. The Rscript lines can be run on a terminal, replacing $SLURM_ARRAY_TASK_ID with all of the indices in the batch array.

The simulation output will be stored in the data folder, with one dataset per choice of parameters. To combine these datasets into a single dataset (as they currently appear in data), run the code in sim_code/combine_datasets.R.

Example: batch_scripts/fig01_fully_NP_randproj.sh

This script reproduces the universal test simulations for Figure 1. To do this, it runs the R script at sim_code/fig01_fully_NP_randproj.R. It reads in the parameters from sim_params/fig01_fully_NP_randproj_params.csv. There are 30 sets of parameters in total. The results will be stored in the data folder, with names such as fig01_fully_NP_randproj_1.csv, ..., fig01_fully_NP_randproj_30.csv. To combine these files into a single .csv file, run the code at sim_code/combine_datasets.R.

Examine the code for a given simulation?

The R code in sim_code is labeled by the figures for which they simulate data. Click on all files corresponding to a given figure.

Reproduce a figure without rerunning the simulations?

The R scripts in plot_code are labeled by their corresponding plots. They read in the necessary simulated data from the data folder and output the figures to the plots folder.

Code for "Universal inference meets random projections: a scalable test for log-concavity"

Related tags

Overview

How to use this repository

Folder contents

How do I ...

Produce the simulations for a given figure?

Examine the code for a given simulation?

Reproduce a figure without rerunning the simulations?

Owner

Robin Dunn

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

Deep functional residue identification

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

Hierarchical-Bayesian-Defense - Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variational Inference (Openreview)

Team Enigma at ArgMining 2021 Shared Task: Leveraging Pretrained Language Models for Key Point Matching

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Towards the D-Optimal Online Experiment Design for Recommender Selection (KDD 2021)

Safe Local Motion Planning with Self-Supervised Freespace Forecasting, CVPR 2021

An End-to-End Machine Learning Library to Optimize AUC (AUROC, AUPRC).

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

A state-of-the-art semi-supervised method for image recognition

A computational optimization project towards the goal of gerrymandering the results of a hypothetical election in the UK.

PyTorch code accompanying our paper on Maximum Entropy Generators for Energy-Based Models

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

The code written during my Bachelor Thesis "Classification of Human Whole-Body Motion using Hidden Markov Models".

Differentiable molecular simulation of proteins with a coarse-grained potential

Pre-training of Graph Augmented Transformers for Medication Recommendation

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

The implementation of 'Image synthesis via semantic composition'.

FairMOT for Multi-Class MOT using YOLOX as Detector