A fast implementation of bss_eval metrics for blind source separation

Last update: Dec 13, 2022

Overview

fast_bss_eval

Do you have a zillion BSS audio files to process and it is taking days ? Is your simulation never ending ?

Fear no more! fast_bss_eval is here to help you!

fast_bss_eval is a fast implementation of the bss_eval metrics for the evaluation of blind source separation. Our implementation of the bss_eval metrics has the following advantages compared to other existing ones.

seamlessly works with both numpy arrays and pytorch tensors
very fast
can be even faster by using an iterative solver (add use_cg_iter=10 option to the function call)
differentiable via pytorch
can run on GPU via pytorch

Author

Robin Scheibler

Quick Start

Install

# from pypi
pip install fast-bss-eval

# or from source
git clone https://github.com/fakufaku/fast_bss_eval
cd fast_bss_eval
pip install -e .

Use

Assuming you have multichannel signals for the estmated and reference sources stored in wav format files names my_estimate_file.wav and my_reference_file.wav, respectively, you can quickly evaluate the bss_eval metrics as follows.

from scipy.io import wavfile
import fast_bss_eval

# open the files, we assume the sampling rate is known
# to be the same
fs, ref = wavfile.read("my_reference_file.wav")
_, est = wavfile.read("my_estimate_file.wav")

# compute the metrics
sdr, sir, sar, perm = fast_bss_eval.bss_eval_sources(ref.T, est.T)

Benchmark

This package is significantly faster than other packages that also allow to compute bss_eval metrics such as mir_eval or sigsep/bsseval. We did a benchmark using numpy/torch, single/double precision floating point arithmetic (fp32/fp64), and using either Gaussian elimination or a conjugate gradient descent (solve/CGD10).

Citation

If you use this package in your own research, please cite our paper describing it.

@misc{scheibler_sdr_2021,
  title={SDR --- Medium Rare with Fast Computations},
  author={Robin Scheibler},
  year={2021},
  eprint={2110.06440},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
}

License

2021 (c) Robin Scheibler, LINE Corporation

This code is released under MIT License.

A fast implementation of bss_eval metrics for blind source separation

Related tags

Overview

fast_bss_eval

Author

Quick Start

Install

Use

Benchmark

Citation

License

Owner

Robin Scheibler

Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'

Image Segmentation Animation using Quadtree concepts.

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

Towards Fine-Grained Reasoning for Fake News Detection

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

Arquitetura e Desenho de Software.

SlotRefine: A Fast Non-Autoregressive Model forJoint Intent Detection and Slot Filling

[ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

PyTorch implementation for our paper Learning Character-Agnostic Motion for Motion Retargeting in 2D, SIGGRAPH 2019

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

PyTorch implementation of Weak-shot Fine-grained Classification via Similarity Transfer

Tensorflow 2 implementation of our high quality frame interpolation neural network

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

Tensors and neural networks in Haskell

Official public repository of paper "Intention Adaptive Graph Neural Network for Category-Aware Session-Based Recommendation"

Automatic Idiomatic Expression Detection

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.