EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow

Last update: Dec 19, 2022

Overview

EfficientDet

This is an implementation of EfficientDet for object detection on Keras and Tensorflow. The project is based on the official implementation google/automl, fizyr/keras-retinanet and the qubvel/efficientnet.

About pretrained weights

The pretrained EfficientNet weights on imagenet are downloaded from Callidior/keras-applications/releases
The pretrained EfficientDet weights on coco are converted from the official release google/automl.

Thanks for their hard work. This project is released under the Apache License. Please take their licenses into consideration too when use this project.

Updates

[03/21/2020] Synchronize with the official implementation. google/automl
[03/05/2020] Anchor free version. The accuracy is a little lower, but it's faster and smaller.For details, please refer to xuannianz/SAPD
[02/20/2020] Support quadrangle detection. For details, please refer to README_quad

Train

build dataset

Pascal VOC
- Download VOC2007 and VOC2012, copy all image files from VOC2007 to VOC2012.
- Append VOC2007 train.txt to VOC2012 trainval.txt.
- Overwrite VOC2012 val.txt by VOC2007 val.txt.
MSCOCO 2017
- Download images and annotations of coco 2017
- Copy all images into datasets/coco/images, all annotations into datasets/coco/annotations
Other types please refer to fizyr/keras-retinanet)

train

STEP1: python3 train.py --snapshot imagenet --phi {0, 1, 2, 3, 4, 5, 6} --gpu 0 --random-transform --compute-val-loss --freeze-backbone --batch-size 32 --steps 1000 pascal|coco datasets/VOC2012|datasets/coco to start training. The init lr is 1e-3.
STEP2: python3 train.py --snapshot xxx.h5 --phi {0, 1, 2, 3, 4, 5, 6} --gpu 0 --random-transform --compute-val-loss --freeze-bn --batch-size 4 --steps 10000 pascal|coco datasets/VOC2012|datasets/coco to start training when val mAP can not increase during STEP1. The init lr is 1e-4 and decays to 1e-5 when val mAP keeps dropping down.

Evaluate

PASCAL VOC
- python3 eval/common.py to evaluate pascal model by specifying model path there.
- The best evaluation results (score_threshold=0.01, mAP₅₀) on VOC2007 test are:
phi 0 1

w/o weighted 0.8029

w/ weighted 0.7892
MSCOCO
- python3 eval/coco.py to evaluate coco model by specifying model path there.
phi mAP

0 0.334 weights, results

1 0.393 weights, results

2 0.424 weights, results

3 0.454 weights, results

4 0.483 weights, results

phi	0	1
w/o weighted		0.8029
w/ weighted	0.7892

phi	mAP
0	0.334 weights, results
1	0.393 weights, results
2	0.424 weights, results
3	0.454 weights, results
4	0.483 weights, results

Test

python3 inference.py to test your image by specifying image path and model path there.

EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow

Related tags

Overview

EfficientDet

About pretrained weights

Train

build dataset

train

Evaluate

Test

Owner

Binary Stochastic Neurons in PyTorch

MLSpace: Hassle-free machine learning & deep learning development

Learning to Predict Gradients for Semi-Supervised Continual Learning

The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

SpiroMask: Measuring Lung Function Using Consumer-Grade Masks

Reading list for research topics in Masked Image Modeling

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

Replication of Pix2Seq with Pretrained Model

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training

On the adaptation of recurrent neural networks for system identification

RL agent to play μRTS with Stable-Baselines3

Learning Spatio-Temporal Transformer for Visual Tracking

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

Generate Contextual Directory Wordlist For Target Org

Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification

A custom DeepStack model that has been trained detecting ONLY the USPS logo

EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow

Related tags

Overview

EfficientDet

About pretrained weights

Train

build dataset

train

Evaluate

Test

Owner

Binary Stochastic Neurons in PyTorch

MLSpace: Hassle-free machine learning & deep learning development

Learning to Predict Gradients for Semi-Supervised Continual Learning

The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

SpiroMask: Measuring Lung Function Using Consumer-Grade Masks

Reading list for research topics in Masked Image Modeling

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

2021搜狐校园文本匹配算法大赛 分比我们低的都是帅哥队

Replication of Pix2Seq with Pretrained Model

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training

On the adaptation of recurrent neural networks for system identification

RL agent to play μRTS with Stable-Baselines3

Learning Spatio-Temporal Transformer for Visual Tracking

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

Generate Contextual Directory Wordlist For Target Org

Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification

A custom DeepStack model that has been trained detecting ONLY the USPS logo

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队