Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Last update: Oct 14, 2022

Related tags

Text Data & NLP loa

Overview

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

This is the project page for Finding Label and Model Errors in Perception Data With Learned Observation Assertions.

Please read the paper for full technical details.

Installation

In the root directory, run

pip install -e .

Examples

We provide an example of the Lyft Level 5 percetion dataset. We have provided model predictions for convenience, but you will need to download the dataset here.

All of the scripts are available in examples/lyft_level5. In order to run the scripts, do the following:

Set the data directories in constants.py.
Learn the priors with learn_priors.py.
Run LOA with prior_lyft.py.

You can visualize the results with viz_track.py.

Citation

If you find this project useful, please cite us at

@article{kang2021finding,
  title={Finding Label and Model Errors in Perception Data With Learned Observation Assertions},
  author={Kang, Daniel and Arechiga, Nikos and Pillai, Sudeep and Bailis, Peter and Zaharia, Matei},
}

and contact us if you deploy LOA!

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Related tags

Overview

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Installation

Examples

Citation

Owner

Stanford Future Data Systems

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

AudioCLIP Extending CLIP to Image, Text and Audio

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Labelling platform for text using distant supervision

Chinese version of GPT2 training code, using BERT tokenizer.

IEEEXtreme15.0 Questions And Answers

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

Common Voice Dataset explorer

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Edge-Augmented Graph Transformer

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Simple NLP based project without any use of AI

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

Biterm Topic Model (BTM): modeling topics in short texts

Python3 to Crystal Translation using Python AST Walker

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Facilitating the design, comparison and sharing of deep text matching models.