A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Last update: Jan 17, 2022

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

This is the repository for our Paper/Contribution to the WI2022 in Nürnberg.

Abstract

In recent years, large pre-trained deep neural networks (DNNs) have revolutionized the field of computer vision (CV). Although these DNNs have been shown to be very well suited for general image recognition tasks, application in industry is often precluded for three reasons:

large pre-trained DNNs are built on hundreds of millions of parameters, making deployment on many devices impossible,
the underlying dataset for pre-training consists of general objects, while industrial cases often consist of very specific objects, such as structures on solar wafers,
potentially biased pre-trained DNNs raise legal issues for companies.

As a remedy, we study neural networks for CV that we train from scratch. For this purpose, we use a real-world case from a solar wafer manufacturer. We find that our neural networks achieve similar performances as pre-trained DNNs, even though they consist of far fewer parameters and do not rely on third-party datasets.

Structure of this repository

+-- ImageClassification            | Runner Notebook + Scripts for experiments
+-- ReadMe.md			   | ReadMe
+-- Results.xlsx                   | Results that were reported in the paper
+-- RunResults                     | Detailed logging of our experiments results that were reported in the paper (IDs correspond to old IDs in the .xlsx file due to procedure)

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

Computer_Vision_Segmentation_Fun Segmentation of Images and Video. Tools: pytorch Models: Classic model - GrabCut Deep model - Deeplabv3_resnet101 Flo

1 Dec 18, 2021

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision Project | Arxiv | Abstract It is very challenging for various visual tasks such as image

377 Jan 7, 2023

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

MobileViT RegNet Unofficial PyTorch implementation of MobileViT based on paper MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE, AND MOBILE-FRIENDLY VISION TR

91 Dec 2, 2022

Best Practices on Recommendation Systems

Recommenders What's New (February 4, 2021) We have a new relase Recommenders 2021.2! It comes with lots of bug fixes, optimizations and 3 new algorith

14.8k Jan 3, 2023

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets This is the official implementation of "Towards Good Pract

52 Nov 22, 2022

A DeepStack custom model for detecting common objects in dark/night images and videos.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Abstract

Structure of this repository

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Best Practices on Recommendation Systems

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

A DeepStack custom model for detecting common objects in dark/night images and videos.

An unofficial styleguide and best practices summary for PyTorch

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Dark Finix: All in one hacking framework with almost 100 tools

Releases(v1.0)

v1.0(Jan 5, 2022)

Owner

Maximilian Harl

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

Block-wisely Supervised Neural Architecture Search with Knowledge Distillation (CVPR 2020)

Tf alloc - Simplication of GPU allocation for Tensorflow2

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Unified tracking framework with a single appearance model

Code release for "Transferable Semantic Augmentation for Domain Adaptation" (CVPR 2021)

A repo with study material, exercises, examples, etc for Devnet SPAUTO

A strongly-typed genetic programming framework for Python

CNNs for Sentence Classification in PyTorch

DeLiGAN - This project is an implementation of the Generative Adversarial Network

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms applied on Continuous Control Tasks

WatermarkRemoval-WDNet-WACV2021

AI-generated-characters for Learning and Wellbeing

Robust and Accurate Object Detection via Self-Knowledge Distillation

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".

LaBERT - A length-controllable and non-autoregressive image captioning model.

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning