Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

Last update: Dec 26, 2022

Related tags

Deep Learning Metrics

Overview

Note: the current releases of this toolbox are a beta release, to test working with Haskell's, Python's, and R's code repositories.

Metrics provides implementations of various supervised machine learning evaluation metrics in the following languages:

Python easy_install ml_metrics
R install.packages("Metrics") from the R prompt
Haskell cabal install Metrics
MATLAB / Octave (clone the repo & run setup from the MATLAB command line)

For more detailed installation instructions, see the README for each implementation.

EVALUATION METRICS

Evaluation Metric	Python	R	Haskell	MATLAB / Octave
Absolute Error (AE)	✓	✓	✓	✓
Average Precision at K (APK, [email protected])	✓	✓	✓	✓
Area Under the ROC (AUC)	✓	✓	✓	✓
Classification Error (CE)	✓	✓	✓	✓
F1 Score (F1)		✓
Gini				✓
Levenshtein	✓		✓	✓
Log Loss (LL)	✓	✓	✓	✓
Mean Log Loss (LogLoss)	✓	✓	✓	✓
Mean Absolute Error (MAE)	✓	✓	✓	✓
Mean Average Precision at K (MAPK, [email protected])	✓	✓	✓	✓
Mean Quadratic Weighted Kappa	✓	✓		✓
Mean Squared Error (MSE)	✓	✓	✓	✓
Mean Squared Log Error (MSLE)	✓	✓	✓	✓
Normalized Gini				✓
Quadratic Weighted Kappa	✓	✓		✓
Relative Absolute Error (RAE)		✓
Root Mean Squared Error (RMSE)	✓	✓	✓	✓
Relative Squared Error (RSE)		✓
Root Relative Squared Error (RRSE)		✓
Root Mean Squared Log Error (RMSLE)	✓	✓	✓	✓
Squared Error (SE)	✓	✓	✓	✓
Squared Log Error (SLE)	✓	✓	✓	✓

TO IMPLEMENT

F1 score
Multiclass log loss
Lift
Average Precision for binary classification
precision / recall break-even point
cross-entropy
True Pos / False Pos / True Neg / False Neg rates
precision / recall / sensitivity / specificity
mutual information

HIGHER LEVEL TRANSFORMATIONS TO HANDLE

GroupBy / Reduce
Weight individual samples or groups

PROPERTIES METRICS CAN HAVE

(Nonexhaustive and to be added in the future)

Min or Max (optimize through minimization or maximization)
Binary Classification
- Scores predicted class labels
- Scores predicted ranking (most likely to least likely for being in one class)
- Scores predicted probabilities
Multiclass Classification
- Scores predicted class labels
- Scores predicted probabilities
Regression
Discrete Rater Comparison (confusion matrix)

Owner

Ben Hamner

Co-founder and CTO of Kaggle

Ben Hamner

GitHub Repository

DeceFL: A Principled Decentralized Federated Learning Framework

DeceFL: A Principled Decentralized Federated Learning Framework This repository comprises codes that reproduce experiments in Ye, et al (2021), which

10 May 31, 2022

From Perceptron model to Deep Neural Network from scratch in Python.

Neural-Network-Basics Aim of this Repository: From Perceptron model to Deep Neural Network (from scratch) in Python. ** Currently working on a basic N

1 Jan 14, 2022

PyTorch code for Composing Partial Differential Equations with Physics-Aware Neural Networks

FInite volume Neural Network (FINN) This repository contains the PyTorch code for models, training, and testing, and Python code for data generation t

20 Dec 18, 2022

Official code for On Path Integration of Grid Cells: Group Representation and Isotropic Scaling (NeurIPS 2021)

On Path Integration of Grid Cells: Group Representation and Isotropic Scaling This repo contains the official implementation for the paper On Path Int

39 Nov 10, 2022

This is a collection of our NAS and Vision Transformer work.

AutoML - Neural Architecture Search This is a collection of our AutoML-NAS work iRPE (NEW): Rethinking and Improving Relative Position Encoding for Vi

832 Jan 08, 2023

PyTorch Implementation of ECCV 2020 Spotlight TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images

TuiGAN-PyTorch Official PyTorch Implementation of "TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images" (ECCV 2020 Spotligh

181 Dec 09, 2022

Interactive web apps created using geemap and streamlit

geemap-apps Introduction This repo demostrates how to build a multi-page Earth Engine App using streamlit and geemap. You can deploy the app on variou

27 Dec 23, 2022

[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang

Undistillable: Making A Nasty Teacher That CANNOT teach students "Undistillable: Making A Nasty Teacher That CANNOT teach students" Haoyu Ma, Tianlong

71 Dec 28, 2022

All public open-source implementations of convnets benchmarks

convnet-benchmarks Easy benchmarking of all public open-source implementations of convnets. A summary is provided in the section below. Machine: 6-cor

2.7k Dec 30, 2022

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis. You write a high level configuration file specifying your in

917 Jan 03, 2023

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

MediumVC MediumVC is an utterance-level method towards any-to-any VC. Before that, we propose SingleVC to perform A2O tasks(Xi → Ŷi) , Xi means utter

47 Dec 25, 2022

Molecular AutoEncoder in PyTorch

MolEncoder Molecular AutoEncoder in PyTorch Install $ git clone https://github.com/cxhernandez/molencoder.git && cd molencoder $ python setup.py insta

80 Dec 05, 2022

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Convolutional Neural Network (CNN). This repository contains a source code of a deep learning network built with TensorFlow and Keras to classify gend

1 Dec 18, 2021

git《FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding》(CVPR 2021) GitHub: [fig8]

FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding (CVPR 2021) This repo contains the implementation of our state-of-the-art fewshot ob

233 Dec 29, 2022

A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.

Imbalanced Dataset Sampler Introduction In many machine learning applications, we often come across datasets where some types of data may be seen more

2k Jan 08, 2023

Code for NeurIPS 2020 article "Contrastive learning of global and local features for medical image segmentation with limited annotations"

Contrastive learning of global and local features for medical image segmentation with limited annotations The code is for the article "Contrastive lea

152 Dec 22, 2022

Collection of common code that's shared among different research projects in FAIR computer vision team.

fvcore fvcore is a light-weight core library that provides the most common and essential functionality shared in various computer vision frameworks de

1.5k Jan 07, 2023

Code for weakly supervised segmentation of a single class

SingleClassRL Implementation of weak single object segmentation from paper "Regularized Loss for Weakly Supervised Single Class Semantic Segmentation"

16 Nov 14, 2022

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations)

Graph Neural Networks with Learnable Structural and Positional Representations Source code for the paper "Graph Neural Networks with Learnable Structu

180 Dec 22, 2022

A toolset for creating Qualtrics-based IAT experiments

Qualtrics IAT Tool A web app for generating the Implicit Association Test (IAT) running on Qualtrics Online Web App The app is hosted by Streamlit, a

0 Feb 12, 2022