a dnn ai project to classify which food people are eating on audio recordings

Last update: Oct 24, 2021

Related tags

Overview

Deep Learning - EAT Challenge

About

This project is part of an AI challenge of the DeepLearning course 2021 at the University of Augsburg. The objective to be learned is a classification task telling which food people are eating on audio recordings.

Students

This project was created by:

Benjamin Möckl
Julian Göser
Marco Tröster

EAT Dataset Setup

For your convenience, the download of all external project assets (dataset and evaluation metrics) has been automated by a shell script. After executing the script you should be ready to run / develop the project code.

# download and unpack the dataset and metric files
./init_dataset_and_metrics.sh <dataset zip password>

How to Run

First, cache the input dataset as TFRecord files for a training session (e.g. naive training). This should massively improve your training performance (especially with low CPU / GPU resources).

# cache the preprocessed audio dataset as TFRecord file
python src/main.py preprocess_dataset naive

Now, you can launch a training session (e.g. naive training).

# process a training session
python src/main.py run_training naive

After that you can sample all inputs of the unknown test dataset using a trained model and export the prediction results for EAT challenge submission.

# evaluate the results for submission
python src/main.py eval_results naive

Valid training configurations are:

naive
noisy
autoenc
amplitude

Remark: Use a GPU empowered machine for amplitude training (although it won't be too rewarding anyways). Tested on Ubuntu 20.04. For running on Windows, the keras ModelCheckpoint Callback has to be switched to our SaveBestAccuracyCallback.

Training Results

Training	Approach Description	Test Acc.	Real Acc.
Naive	Train on audio melspectrograms using Conv2D	0.41	0.36
Noisy	Train on audio melspectrograms using custom noisy Conv2D	0.44	0.39
Amplitude	Train on audio amplitude using Conv1D	0.23	?.??
AutoEnc	Train on audio melspectrograms using an Auto Encoder	0.25	?.??

a dnn ai project to classify which food people are eating on audio recordings

Related tags

Overview

Deep Learning - EAT Challenge

About

Students

EAT Dataset Setup

How to Run

Training Results

Owner

Marco Tröster

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers.

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

Code for A Volumetric Transformer for Accurate 3D Tumor Segmentation

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"

Implementation of the Paper: "Parameterized Hypercomplex Graph Neural Networks for Graph Classification" by Tuan Le, Marco Bertolini, Frank Noé and Djork-Arné Clevert

Real-time pose estimation accelerated with NVIDIA TensorRT

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Training Cifar-10 Classifier Using VGG16

An open source Jetson Nano baseboard and tools to design your own.

Locally cache assets that are normally streamed in POPULATION: ONE

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Deep Learning GPU Training System

This is a model to classify Vietnamese sign language using Motion history image (MHI) algorithm and CNN.

This is an implementation of PIFuhd based on Pytorch

The project covers common metrics for super-resolution performance evaluation.

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images

The Official PyTorch Implementation of "VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models" (ICLR 2021 spotlight paper)