YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Last update: Jan 01, 2023

Related tags

Deep Learning yoltv5

Overview

YOLTv5

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks.

YOLTv5 builds upon YOLT and SIMRDWN, and updates these frameworks to use the YOLOv5 version of the YOLO object detection family. This repository has generally similar performance to the Darknet-based YOLTv4 repository. For those users who prefer a PyTorch backend, however, we provide YOLTv5.

Below, we provide examples of how to use this repository with the open-source SpaceNet dataset.

Running YOLTv5

0. Installation (Preliminary)

YOLTv5 is built to execute on a GPU-enabled machine.

cd yoltv5/yolov5
pip install -r requirements.txt 

# update with geo packages
conda install -c conda-forge gdal
conda install -c conda-forge osmnx=0.12 
conda install  -c conda-forge scikit-image
conda install  -c conda-forge statsmodels
pip install torchsummary
pip install utm
pip install numba
pip install jinja2==2.10

1. Train

Training preparation is accomplished via prep_train.py. To train a model, run:

cd /yoltv5
python yolov5/train.py --img 640 --batch 16 --epochs 100 --data yoltv5_train_vehicles_8cat.yaml --weights yolov5l.pt

2. Test

Simply edit yoltv5_test_vehicles_8cat.yaml to point to the appropriate locations, then run the test.sh script:

cd yoltv5
./test.sh ../configs/yoltv5_test_vehicles_8cat.yaml

Outputs will look something like the figure below:

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Related tags

Overview

YOLTv5

Running YOLTv5

0. Installation (Preliminary)

1. Train

2. Test

Owner

Adam Van Etten

EsViT: Efficient self-supervised Vision Transformers

[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

Neural Point-Based Graphics

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

Dense Unsupervised Learning for Video Segmentation (NeurIPS*2021)

LegoDNN: a block-grained scaling tool for mobile vision systems

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

A simple interface for editing natural photos with generative neural networks.

SSD: A Unified Framework for Self-Supervised Outlier Detection [ICLR 2021]

This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)

Pytorch implementation for RelTransformer

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.

Resources related to our paper "CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain"

A Python 3 package for state-of-the-art statistical dimension reduction methods

Code for the paper "Combining Textual Features for the Detection of Hateful and Offensive Language"

Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

Semi-supervised semantic segmentation needs strong, varied perturbations

Tutorial on active learning with the Nvidia Transfer Learning Toolkit (TLT).

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

Deep Learning agent of Starcraft2, similar to AlphaStar of DeepMind except size of network.