BARTScore: Evaluating Generated Text as Text Generation

Last update: Dec 17, 2022

Related tags

Deep Learning BARTScore

Overview

This is the Repo for the paper: BARTScore: Evaluating Generated Text as Text Generation

Updates

2021.06.28 Release online evaluation Demo
2021.06.25 Release online Explainable Leaderboard for Meta-evaluation
2021.06.22 Code will be released soon

Background

There is a recent trend that leverages neural models for automated evaluation in different ways, as shown in Fig.1.

(a) Evaluation as matching task. Unsupervised matching metrics aim to measure the semantic equivalence between the reference and hypothesis by using a token-level matching functions in distributed representation space (e.g. BERT) or discrete string space (e.g. ROUGE).

(b) Evaluation as regression task. Regression-based metrics (e.g. BLEURT) introduce a parameterized regression layer, which would be learned in a supervised fashion to accurately predict human judgments.

(c) Evaluation as ranking task. Ranking-based metrics (e.g. COMET) aim to learn a scoring function that assigns a higher score to better hypotheses than to worse ones.

(d) Evaluation as generation task. In this work, we formulate evaluating generated text as a text generation task from pre-trained language models.

BARTScore: Evaluating Generated Text as Text Generation

Related tags

Overview

Updates

Background

Owner

NeuLab

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

code for paper"A High-precision Semantic Segmentation Method Combining Adversarial Learning and Attention Mechanism"

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

StyleGAN2 Webtoon / Anime Style Toonify

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Deep Video Matting via Spatio-Temporal Alignment and Aggregation [CVPR2021]

Block Sparse movement pruning

Cards Against Humanity AI

Small-bets - Ergodic Experiment With Python

Semi-supervised Stance Detection of Tweets Via Distant Network Supervision

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

A clear, concise, simple yet powerful and efficient API for deep learning.

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Deep-learning X-Ray Micro-CT image enhancement, pore-network modelling and continuum modelling

Automatically creates genre collections for your Plex media

Code for "The Box Size Confidence Bias Harms Your Object Detector"

The authors' official PyTorch SigWGAN implementation

Multi-Object Tracking in Satellite Videos with Graph-Based Multi-Task Modeling

Long Expressive Memory (LEM)