Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Last update: Dec 26, 2022

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Deploying deep convolutional neural networks on ultra-low power systems is challenging, because the systems put a hard limit on the size of on-chip memory. To overcome this drawback, we propose a novel Raster-Scanning Network, named RaScaNet, inspired by raster-scanning in image sensors.

RaScaNet reads only a few rows of pixels at a time using a convolutional neural network and then sequentially learns the representation of the whole image using a recurrent neural network. The proposed method requires 15.9-24.3x smaller peak memory and 5.3-12.9x smaller weight memory than the state-of-the-art tiny models. The total memory usage of RaScaNet does not exceed 60 KB, in the VWW dataset with competitive accuracy.

Conference: CVPR 2021
Paper | Video | Citation

Requirements

python 3.6
torch 1.7.0
torchvision 0.8.1
pycocotools 2.0.1
numpy 0.19.0
VWW dataset

Usage

For running the model, (only support vww dataset)

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar

With early termination,

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar --early_terminate=1
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar --early_terminate=1

Currently, we do not provide the code for training.

Result

Model	Weight Memory	Peak Memory	OPs Cnt.	Accuracy
rascanet(210x240)	47.03 KB	7.92 KB	56.34 M	91.835%
rascanet(105x120)	31.77 KB	3.60 KB	9.71 M	88.100%

Citation

@InProceedings{Yoo_2021_CVPR,
    author    = {Yoo, Jaehyoung and Lee, Dongwook and Son, Changyong and Jung, Sangil and Yoo, ByungIn and Choi, Changkyu and Han, Jae-Joon and Han, Bohyung},
    title     = {RaScaNet: Learning Tiny Models by Raster-Scanning Images},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {13673-13682}
}

License

Copyright (C) 2021 Samsung Electronics Co. LTD

This software is a property of Samsung Electronics.
No part of this software, either material or conceptual may be copied or distributed, transmitted,
transcribed, stored in a retrieval system or translated into any human or computer language in any form by any means,
electronic, mechanical, manual or otherwise, or disclosed
to third parties without the express written permission of Samsung Electronics.
(Use of the Software is restricted to non-commercial, personal or academic, research purpose only)

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Requirements

Usage

Result

Citation

License

Owner

SAIT (Samsung Advanced Institute of Technology)

Compute FID scores with PyTorch.

Build a medical knowledge graph based on Unified Language Medical System (UMLS)

TinyML Cookbook, published by Packt

Implementation of Continuous Sparsification, a method for pruning and ticket search in deep networks

[ICCV2021] IICNet: A Generic Framework for Reversible Image Conversion

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

PyoMyo - Python Opensource Myo library

An End-to-End Machine Learning Library to Optimize AUC (AUROC, AUPRC).

MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

Data augmentation for NLP, accepted at EMNLP 2021 Findings

Parametric Contrastive Learning (ICCV2021)

Elegy is a framework-agnostic Trainer interface for the Jax ecosystem.

Learning to Reach Goals via Iterated Supervised Learning

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

A simple interface for editing natural photos with generative neural networks.

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Implementation of PyTorch-based multi-task pre-trained models

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image