CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Last update: Dec 29, 2022

Related tags

Overview

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

性能情况

训练数据集	权值文件名称	测试数据集	输入图片大小	mAP 0.5:0.95	mAP 0.5
VOC07+12	centernet_resnet50_voc.pth	VOC-Test07	512x512	-	77.1
COCO-Train2017	centernet_hourglass_coco.pth	COCO-Val2017	512x512	38.4	56.8

所需环境

torch==1.2.0

注意事项

代码中的centernet_resnet50_voc.pth是使用voc数据集训练的。
代码中的centernet_hourglass_coco.pth是使用coco数据集训练的。
注意不要使用中文标签，文件夹中不要有空格！
在训练前需要务必在model_data下新建一个txt文档，文档中输入需要分的类，在train.py中将classes_path指向该文件。

文件下载

训练所需的centernet_resnet50_voc.pth、centernet_hourglass_coco.pth可在百度网盘中下载。
链接: https://pan.baidu.com/s/1QBBgRb_TH8kJdSCQGgcXmQ 提取码: phnc

centernet_resnet50_voc.pth是voc数据集的权重。
centernet_hourglass_coco.pth是coco数据集的权重。

预测步骤

a、使用预训练权重

下载完库后解压，在百度网盘下载centernet_resnet50_voc.pth或者centernet_hourglass_coco.pth，放入model_data，运行predict.py，输入

img/street.jpg

利用video.py可进行摄像头检测。

b、使用自己训练的权重

按照训练步骤训练。
在yolo.py文件里面，在如下部分修改model_path和classes_path使其对应训练好的文件；model_path对应logs文件夹下面的权值文件，classes_path是model_path对应分的类。

_defaults = {
    "model_path"        : 'model_data/centernet_resnet50_voc.pth',
    "classes_path"      : 'model_data/voc_classes.txt',
    # "model_path"        : 'model_data/centernet_hourglass_coco.h5',
    # "classes_path"      : 'model_data/coco_classes.txt',
    "backbone"          : "resnet50",
    "image_size"        : [512,512,3],
    "confidence"        : 0.3,
    # backbone为resnet50时建议设置为True
    # backbone为hourglass时建议设置为False
    # 也可以根据检测效果自行选择
    "nms"               : True,
    "nms_threhold"      : 0.3,
    "cuda"              : True
}

运行predict.py，输入

img/street.jpg

利用video.py可进行摄像头检测。

训练步骤

本文使用VOC格式进行训练。
训练前将标签文件放在VOCdevkit文件夹下的VOC2007文件夹下的Annotation中。
训练前将图片文件放在VOCdevkit文件夹下的VOC2007文件夹下的JPEGImages中。
在训练前利用voc2centernet.py文件生成对应的txt。
再运行根目录下的voc_annotation.py，运行前需要将classes改成你自己的classes。注意不要使用中文标签，文件夹中不要有空格！

classes = ["aeroplane", "bicycle", "bird", "boat", "bottle", "bus", "car", "cat", "chair", "cow", "diningtable", "dog", "horse", "motorbike", "person", "pottedplant", "sheep", "sofa", "train", "tvmonitor"]

此时会生成对应的2007_train.txt，每一行对应其图片位置及其真实框的位置。
在训练前需要务必在model_data下新建一个txt文档，文档中输入需要分的类，在train.py中将classes_path指向该文件，示例如下：

classes_path = 'model_data/new_classes.txt'

model_data/new_classes.txt文件内容为：

cat
dog
...

运行train.py即可开始训练。

mAP目标检测精度计算更新

更新了get_gt_txt.py、get_dr_txt.py和get_map.py文件。
get_map文件克隆自https://github.com/Cartucho/mAP
具体mAP计算过程可参考：https://www.bilibili.com/video/BV1zE411u7Vw

Reference

https://github.com/xuannianz/keras-CenterNet
https://github.com/see--/keras-centernet
https://github.com/xingyizhou/CenterNet

Comments

map指标

B导，我在使用get_map.py的时候，您的初始设置confidence为0.02，我正常得到map结果，但是我像其他网络一样把confidence修改成为0.001以后就得不到map结果了，这是为什么呢？还有就是想问一下，在计算voc的map时，confidence都应该设置为很低，所以是不是0.02和0.001的效果相似？谢谢b导

opened by ChristmasLee 2
训练没有归一化，预测却有归一化，是不是有问题？

训练时候加载数据是dataloader.py 222行，是没有对图片做mean和std归一化的，但预测时predict.py -> centernet.py -> util/util.py -> preprocess_input里却对图片做了mean、std归一化，这应该有问题吧？

opened by seven-linglx 2
显示no mudule named 'past'

Traceback (most recent call last): File "train.py", line 15, in from utils.callbacks import LossHistory File "/root/centernet/centernet-pytorch-main/utils/callbacks.py", line 9, in from torch.utils.tensorboard import SummaryWriter File "/root/.local/lib/python3.7/site-packages/torch/utils/tensorboard/init.py", line 6, in from .writer import FileWriter, SummaryWriter # noqa F401 File "/root/.local/lib/python3.7/site-packages/torch/utils/tensorboard/writer.py", line 18, in from ._convert_np import make_np File "/root/.local/lib/python3.7/site-packages/torch/utils/tensorboard/_convert_np.py", line 12, in from caffe2.python import workspace File "/root/.local/lib/python3.7/site-packages/caffe2/python/workspace.py", line 15, in from past.builtins import basestring

opened by buloseshi 1
请问我改mobilenetv3的时候运行到第7批次就自动停止了是怎么回事呢

Finish Validation 0%| | 0/119 [00:00<?, ?it/s]Get map. 0%| | 0/119 [00:00<?, ?it/s] Traceback (most recent call last): File "/home/linux/data2/sun/centernet-pytorch-main/train.py", line 491, in epoch_step, epoch_step_val, gen, gen_val, UnFreeze_Epoch, Cuda, fp16, scaler, backbone, save_period, save_dir, local_rank) File "/home/linux/data2/sun/centernet-pytorch-main/utils/utils_fit.py", line 161, in fit_one_epoch eval_callback.on_epoch_end(epoch + 1, model_train) File "/home/linux/data2/sun/centernet-pytorch-main/utils/callbacks.py", line 211, in on_epoch_end self.get_map_txt(image_id, image, self.class_names, self.map_out_path) File "/home/linux/data2/sun/centernet-pytorch-main/utils/callbacks.py", line 145, in get_map_txt outputs = decode_bbox(outputs[0], outputs[1], outputs[2], self.confidence, self.cuda) IndexError: list index out of range

opened by sunsn1997 2
第一次尝试的新手提问

按照readme文档中的步骤 1 已解压VOC数据集至项目根目录，pth文件至model_data目录 2 已修改voc_annotation.py 中的annotation_mode为2 3 运行train.py

环境 pytorch1.2 + cuda10.0 +python3.6 ，Ubuntu 刚开始是使用的高版本torch和python，然后也尝试了python3.6+ torch1.2的环境,出现一样的问题

opened by Xie-Muxi 1

Releases(v3.0)

v3.0(Apr 22, 2022)
重要更新

支持step、cos学习率下降法。

支持adam、sgd优化器选择。

支持学习率根据batch_size自适应调整。

支持不同预测模式的选择，单张图片预测、文件夹预测、视频预测、图片裁剪、heatmap、各个种类目标数量计算。

更新summary.py文件，用于观看网络结构。

增加了多GPU训练。

Source code(tar.gz)
Source code(zip)
v2.0(Mar 4, 2022)
重要更新

更新train.py文件，增加了大量的注释，增加多个可调整参数。

更新predict.py文件，增加了大量的注释，增加fps、视频预测、批量预测等功能。

更新centernet.py文件，增加了大量的注释，增加先验框选择、置信度、非极大抑制等参数。

合并get_dr_txt.py、get_gt_txt.py和get_map.py文件，通过一个文件来实现数据集的评估。

更新voc_annotation.py文件，增加多个可调整参数。

更新summary.py文件，用于观看网络结构。

Source code(tar.gz)
Source code(zip)
v1.0(Dec 17, 2020)

Source code(tar.gz)
Source code(zip)
centernet_hourglass_coco.pth(730.32 MB)
centernet_resnet50_voc.pth(124.87 MB)

Owner

Bubbliiiing

GitHub Repository

PyTorch implementation of neural style transfer algorithm

neural-style-pt This is a PyTorch implementation of the paper A Neural Algorithm of Artistic Style by Leon A. Gatys, Alexander S. Ecker, and Matthias

770 Jan 02, 2023

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

HAIS Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021) by Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang*. (*) Corresp

145 Jan 05, 2023

PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability

PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability PCACE is a new algorithm for ranking neurons in a CNN architecture in order

4 Jan 04, 2022

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods Datasets Because of copyright issues, both the MalwareBazaar dataset

8 Oct 21, 2022

African language Speech Recognition - Speech-to-Text

Swahili-Speech-To-Text Table of Contents Swahili-Speech-To-Text Overview Scenario Approach Project Structure data: models: notebooks: scripts tests: l

2 Jan 05, 2023

Cross-platform CLI tool to generate your Github profile's stats and summary.

ghs Cross-platform CLI tool to generate your Github profile's stats and summary. Preview Hop on to examples for other usecases. Jump to: Installation

134 Dec 20, 2022

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

ALPHA 2.5: Frostbite Revival (Released 12/23/21) Changelog: [ UI ] Chained design. All steps link to one another! Use the master override toggles to s

87 Nov 16, 2022

Isaac Gym Reinforcement Learning Environments

714 Jan 08, 2023

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation: Work In Progress, Results can't be replicated yet with the m

196 Aug 30, 2022

Graph Transformer Architecture. Source code for

Graph Transformer Architecture Source code for the paper "A Generalization of Transformer Networks to Graphs" by Vijay Prakash Dwivedi and Xavier Bres

561 Jan 08, 2023

FinEAS: Financial Embedding Analysis of Sentiment 📈

FinEAS: Financial Embedding Analysis of Sentiment 📈 (SentenceBERT for Financial News Sentiment Regression) This repository contains the code for gene

31 Dec 13, 2022

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

ACCENTOR: Adding Chit-Chat to Enhance Task-Oriented Dialogues Overview ACCENTOR consists of the human-annotated chit-chat additions to the 23.8K dialo

69 Dec 29, 2022

Pointer networks Tensorflow2

Pointer networks Tensorflow2 原文：https://arxiv.org/abs/1506.03134 仅供参考与学习，内含代码备注环境 tensorflow==2.6.0 tqdm matplotlib numpy 《pointer networks》阅读笔记应用场景

7 Oct 27, 2022

BLEURT is a metric for Natural Language Generation based on transfer learning.

BLEURT: a Transfer Learning-Based Metric for Natural Language Generation BLEURT is an evaluation metric for Natural Language Generation. It takes a pa

492 Jan 05, 2023

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

MetaAdaptRank This repository provides the implementation of meta-learning to reweight synthetic weak supervision data described in the paper Few-Shot

5 Jun 16, 2022

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition (AGRA, ACM 2020, Oral)

Cross Domain Facial Expression Recognition Benchmark Implementation of papers: Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchm

89 Dec 09, 2022

Hybrid Neural Fusion for Full-frame Video Stabilization

FuSta: Hybrid Neural Fusion for Full-frame Video Stabilization Project Page | Video | Paper | Google Colab Setup Setup environment for [Yu and Ramamoo

430 Jan 04, 2023

Selective Wavelet Attention Learning for Single Image Deraining

SWAL Code for Paper "Selective Wavelet Attention Learning for Single Image Deraining" Prerequisites Python 3 PyTorch Models We provide the models trai

9 Jun 17, 2022

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

BMW-Anonymization-Api Data privacy and individuals’ anonymity are and always have been a major concern for data-driven companies. Therefore, we design

148 Dec 21, 2022

A booklet on machine learning systems design with exercises

Machine Learning Systems Design Read this booklet here. This booklet covers four main steps of designing a machine learning system: Project setup Data

7.6k Jan 08, 2023

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Related tags

Overview

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

目录

性能情况

所需环境

注意事项

文件下载

预测步骤

a、使用预训练权重

b、使用自己训练的权重

训练步骤

mAP目标检测精度计算更新

Reference

Comments

map指标

训练没有归一化，预测却有归一化，是不是有问题？

显示no mudule named 'past'

请问我改mobilenetv3的时候运行到第7批次就自动停止了是怎么回事呢

第一次尝试的新手提问

Releases(v3.0)

v3.0(Apr 22, 2022)

重要更新

v2.0(Mar 4, 2022)

重要更新

v1.0(Dec 17, 2020)

Owner

Bubbliiiing

PyTorch implementation of neural style transfer algorithm

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods

African language Speech Recognition - Speech-to-Text

Cross-platform CLI tool to generate your Github profile's stats and summary.

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

Isaac Gym Reinforcement Learning Environments

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Graph Transformer Architecture. Source code for

FinEAS: Financial Embedding Analysis of Sentiment 📈

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

Pointer networks Tensorflow2

BLEURT is a metric for Natural Language Generation based on transfer learning.

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition (AGRA, ACM 2020, Oral)

Hybrid Neural Fusion for Full-frame Video Stabilization

Selective Wavelet Attention Learning for Single Image Deraining

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

A booklet on machine learning systems design with exercises