🛰️ Awesome Satellite Imagery Datasets

Overview

Awesome Satellite Imagery Datasets Awesome

List of aerial and satellite imagery datasets with annotations for computer vision and deep learning. Newest datasets at the top of each category (Instance segmentation, object detection, semantic segmentation, scene classification, other).

Recent additions and ongoing competitions

1. Instance Segmentation

  • PASTIS: Panoptic Agricultural Satellite TIme Series (IGN, July 2021)
    124,422 Agricultural parcels, 2,433 Sentinel-2 image chip timeseries, France, panoptic labels (instance index + semantic label for each pixel). Paper: Garnot & Landrieu 2021

  • SpaceNet 7: Multi-Temporal Urban Development Challenge (CosmiQ Works, Planet, Aug 2020)
    Monthly building footprints and Planet imagery (4m. res) timeseries for 2 years, 100 locations around the globe, for building footprint evolution & address propagation.

  • RarePlanes: Synthetic Data Takes Flight (CosmiQ Works, A.I.Reverie, June 2020)
    Synthetic (630k planes, 50k images) and real (14.7k planes, 253 Worldview-3 images (0.3m res.), 122 locations, 22 countries) plane annotations & properties and satellite images. Tools. Paper: Shermeyer et al. 2020

  • SpaceNet: Multi-Sensor All-Weather Mapping (CosmiQ Works, Capella Space, Maxar, AWS, Intel, Feb 2020)
    48k building footprints (enhanced 3DBAG dataset, building height attributes), Capella Space SAR data (0.5m res., four polarizations) & Worldview-3 imagery (0.3m res.), Rotterdam, Netherlands.

  • Agriculture-Vision Database & CVPR 2020 challenge (UIUC, Intelinair, CVPR, Jan 2020)
    Agricultural Pattern Analysis, 21k aerial farmland images (RGB-NIR, USA, 2019 season, 512x512px chips), label masks for 6 field anomaly patterns (Cloud shadow, Double plant, Planter skip, Standing Water, Waterway and Weed cluster). Paper: Chiu et al. 2020

  • iSAID: Large-scale Dataset for Object Detection in Aerial Images (IIAI & Wuhan University, Dec 2019)
    15 categories from plane to bridge, 188k instances, object instances and segmentation masks (MS COCO format), Google Earth & JL-1 image chips, Faster-RCNN baseline model (MXNet), devkit, Academic use only, replaces DOTA dataset, Paper: Zamir et al. 2019

  • xView 2 Building Damage Asessment Challenge (DIUx, Nov 2019) .
    550k building footprints & 4 damage scale categories, 20 global locations and 7 disaster types (wildfire, landslides, dam collapses, volcanic eruptions, earthquakes/tsunamis, wind, flooding), Worldview-3 imagery (0.3m res.), pre-trained baseline model. Paper: Gupta et al. 2019

  • Microsoft BuildingFootprints Canada & USA & Uganda/Tanzania & Australia (Microsoft, Mar 2019)
    12.6mil (Canada) & 125.2mil (USA) & 17.9mil (Uganda/Tanzania) & 11.3mil (Australia) building footprints, GeoJSON format, delineation based on Bing imagery using ResNet34 architecture.

  • SpaceNet 4: Off-Nadir Buildings (CosmiQ Works, DigitalGlobe, Radiant Solutions, AWS, Dec 2018)
    126k building footprints (Atlanta), 27 WorldView 2 images (0.3m res.) from 7-54 degrees off-nadir angle. Bi-cubicly resampled to same number of pixels in each image to counter courser native resolution with higher off-nadir angles, Paper: Weir et al. 2019

  • Airbus Ship Detection Challenge (Airbus, Nov 2018)
    131k ships, 104k train / 88k test image chips, satellite imagery (1.5m res.), raster mask labels in in run-length encoding format, Kaggle kernels.

  • Open AI Challenge: Tanzania (WeRobotics & Wordlbank, Nov 2018)
    Building footprints & 3 building conditions, RGB UAV imagery - Link to data

  • LPIS agricultural field boundaries Denmark - Netherlands - France
    Annual datasets. Denmark: 293 crop/vegetation catgeories, 600k parcels. Netherlands: 294 crop/vegetation catgeories, 780k parcels

  • CrowdAI Mapping Challenge (Humanity & Inclusion NGO, May 2018)
    Buildings footprints, RGB satellite imagery, COCO data format

  • SpaceNet 2: Building Detection v2 (CosmiQ Works, Radiant Solutions, NVIDIA, May 2017)
    685k building footprints, 3/8band Worldview-3 imagery (0.3m res.), 5 cities, SpaceNet Challenge Asset Library

  • SpaceNet 1: Building Detection v1 (CosmiQ Works, Radiant Solutions, NVIDIA, Jan 2017)
    Building footprints (Rio de Janeiro), 3/8band Worldview-3 imagery (0.5m res.), SpaceNet Challenge Asset Library

2. Object Detection

3. Semantic Segmentation

  • LoveDA (Wuhan University, Oct 2021)
    5987 image chips (Google Earth), 7 landcover categories, 166768 labels, 3 cities in China. Paper: Wang et al., 2021

  • FloodNet Challenge (UMBC, Microsoft, Texas A&M, Dewberry, May 2021)
    2343 UAV images from after Hurricane Harvey, landcover labels (10 categories, e.g. building flooded, building non-flooded, road-flooded, ..), 2 competition tracks (Binary & semantic flood classification; Object counting & condition recognition)

  • Dynamic EarthNet Challenge (Planet, DLR, TUM, April 2021)
    Weekly Planetscope time-series (3m res.) over 2 years, 75 aois, landcover labels (7 categories), 2 competition tracks (Binary land cover classification & multi-class change detection)

  • Sentinel-2 Cloud Mask Catalogue (Francis, A., et al., Nov 2020) 513 cropped subscenes (1022x1022 pixels) taken randomly from entire 2018 Sentinel-2 archive. All bands resampled to 20m, stored as numpy arrays. Includes clear, cloud and cloud-shadow classes. Also comes with binary classification tags for each subscene, describing what surface types, cloud types, etc. are present.

  • LandCoverNet: A Global Land Cover Classification Training Dataset (Alemohammad S.H., et al., Jul 2020) Version 1.0 of the dataset that contains data across Africa, (20% of the global dataset). 1980 image chips of 256 x 256 pixels in V1.0 spanning 66 tiles of Sentinel-2. Classes: water, natural bare ground, artificial bare ground, woody vegetation, cultivated vegetation, (semi) natural vegetation, and permanent snow/ice. Citation: Alemohammad S.H., et al., 2020 and blog post

  • LandCover.ai: Dataset for Automatic Mapping of Buildings, Woodlands and Water from Aerial Imagery (Boguszewski, A., et al., May 2020) 41 orthophotos (9000x9000 px) over Poland, Aerial Imagery (25cm & 50cm res.), manual segmentations masks for Buildings, Woodland and Water, Paper: Boguszewski et al., 2020

  • 95-Cloud: A Cloud Segmentation Dataset (S. Mohajerani et. all, Jan 2020)
    34701 manually segmented 384x384 patches with cloud masks, Landsat 8 imagery (R,G,B,NIR; 30 m res.), Paper: Mohajerani et al. 2021

  • Open Cities AI Challenge (GFDRR, Mar 2020) .
    790k building footprints from Openstreetmap (2 label quality categories), aerial imagery (0.03-0.2m resolution, RGB, 11k 1024x1024 chips, COG format), 10 cities in Africa.

  • DroneDeploy Segmentation Dataset (DroneDeploy, Dec 2019)
    Drone imagery (0.1m res., RGB), labels (7 land cover catageories: building, clutter, vegetation, water, ground, car) & elevation data, baseline model implementation.

  • SkyScapes: Urban infrastructure & lane markings (DLR, Nov 2019)
    Highly accurate street lane markings (12 categories e.g. dash line, long line, zebra zone) & urban infrastructure (19 categories e.g. buildings, roads, vegetation). Aerial imagery (0.13 m res.) for 5.7 km2 of Munich, Germany. Paper: Azimi et al. 2019

  • Open AI Challenge: Caribbean (MathWorks, WeRobotics, Wordlbank, DrivenData, Dec 2019)
    Predict building roof type (5 categories, e.g. concrete, metal etc.) of provided building footprints (22,553), RGB UAV imagery (4cm res., 7 areas in 3 Carribbean countries)

  • SpaceNet 5: Automated Road Network Extraction & Route Travel Time Estimation (CosmiQ Works, Maxar, Intel, AWS, Sep 2019)
    2300 image chips, street geometries with location, shape and estimated travel time, 3/8band Worldview-3 imagery (0.3m res.), 4 global cities, 1 holdout city for leaderboard evaluation, APLS metric, baseline model

  • SEN12MS (TUM, Jun 2019)
    180,748 corresponding image triplets containing Sentinel-1 (VV&VH), Sentinel-2 (all bands, cloud-free), and MODIS-derived land cover maps (IGBP, LCCS, 17 classes, 500m res.). All data upsampled to 10m res., georeferenced, covering all continents and meterological seasons, Paper: Schmitt et al. 2018

  • Slovenia Land Cover Classification (Sinergise, Feb 2019)
    10 land cover classes, temporal stack of hyperspectral Sentinel-2 imagery (R,G,B,NIR,SWIR1,SWIR2; 10 m res.) for year 2017 with cloud masks, Official Slovenian land use land cover layer as ground truth.

  • ALCD Reference Cloud Masks (CNES, Oct 2018)
    8 classes (inc. cloud and cloud shadow) for 38 Sentinel-2 scenes (10 m res.). Manual labeling & active learning, Paper: Baetens et al. 2019

  • Agricultural Crop Cover Classification Challenge (CrowdANALYTIX, Jul 2018)
    2 main categories corn and soybeans, Landsat 8 imagery (30m res.), USDA Cropland Data Layer as ground truth.

  • SpaceNet 3: Road Network Detection (CosmiQ Works, Radiant Solutions, Feb 2018)
    8000 km of roads in 5 city aois, 3/8band Worldview-3 imagery (0.3m res.), SpaceNet Challenge Asset Library, Paper: Van Etten et al. 2018

  • Urban 3D Challenge (USSOCOM, Dec 2017)
    157k building footprint masks, RGB orthophotos (0.5m res.), DSM/DTM, 3 cities, SpaceNet Challenge Asset Library

  • DSTL Satellite Imagery Feature Detection Challenge (Dstl, Feb 2017)
    10 land cover categories from crops to vehicle small, 57 1x1km images, 3/16-band Worldview 3 imagery (0.3m-7.5m res.), Kaggle kernels

  • SPARCS: S2 Cloud Validation data (USGS, 2016)
    7 categories (cloud, cloud shadows, cloud shadows over water, water etc.), 80 1kx1k px. subset Landsat 8 scenes (30m res.), Paper: Hughes, J.M. & Hayes D.J. 2014

  • Biome: L8 Cloud Cover Validation data (USGS, 2016)
    4 cloud categories (cloud, thin cloud, cloud shadows, clear), 96 Landsat 8 scenes (30m res.), 12 biomes with 8 scenes each, Paper: Foga et al. 2017

  • Inria Aerial Image Labeling (inria.fr)
    Building footprint masks, RGB aerial imagery (0.3m res.), 5 cities

  • ISPRS Potsdam 2D Semantic Labeling Contest (ISPRS)
    6 urban land cover classes, raster mask labels, 4-band RGB-IR aerial imagery (0.05m res.) & DSM, 38 image patches

4. Scene classification

  • Airbus Wind Turbine Patches (Airbus, Mar 2021)
    155k 128x128px image chips with wind turbines (SPOT, 1.5m res.).

  • BigEarthNet: Large-Scale Sentinel-2 Benchmark (TU Berlin, Jan 2019)
    Multiple landcover labels per chip based on CORINE Land Cover (CLC) 2018, 590,326 chips from Sentinel-2 L2A scenes (125 Sentinel-2 tiles from 10 European countries, 2017/2018), 66 GB archive, Paper: Sumbul et al. 2019

  • WiDS Datathon 2019 : Detection of Oil Palm Plantations (Global WiDS Team & West Big Data Innovation Hub, Jan 2019) Prediction of presence of oil palm plantations, Planet satellite imagery (3m res.)., ca. 20k 256 x 256 pixel chips, 2 categories oil-palm and other, annotator confidence score.

  • So2Sat LCZ42 (TUM Munich & DLR, Aug 2018)
    Local climate zone classification, 17 categories (10 urban e.g. compact high-rise, 7 rural e.g. scattered trees), 400k 32x32 pixel chips covering 42 cities (LCZ42 dataset), Sentinel 1 & Sentinel 2 (both 10m res.), 51 GB

  • Cactus Aerial Photos (CONACYT Mexico, Jun 2018)
    17k aerial photos, 13k cactus, 4k non-actus, Kaggle kernels, Paper: López-Jiménez et al. 2019

  • Statoil/C-CORE Iceberg Classifier Challenge (Statoil/C-CORE, Jan 2018)
    2 categories ship and iceberg, 2-band HH/HV polarization SAR imagery, Kaggle kernels

  • Functional Map of the World Challenge (IARPA, Dec 2017)
    63 categories from solar farms to shopping malls, 1 million chips, 4/8 band satellite imagery (0.3m res.), COCO data format, baseline models, Paper: Christie et al. 2017

  • EuroSAT (DFK, Aug 2017)
    10 land cover categories from industrial to permanent crop, 27k 64x64 pixel chips, 3/16 band Sentinel-2 satellite imagery (10m res.), covering cities in 30 countries, Paper: Helber et al. 2017

  • Planet: Understanding the Amazon from Space (Planet, Jul 2017)
    13 land cover categories + 4 cloud condition categories, 4-band (RGB-NIR) satelitte imagery (5m res.), Amazonian rainforest, Kaggle kernels

  • AID: Aerial Scene Classification (Xia et al., 2017)
    10000 aerial images within 30 categories (airport, bare land, baseball field, beach, bridge, ...) collected from Google Earth imagery. Paper: Xia et al. 2017

  • RESISC45 (Northwestern Polytechnical University NWPU, Mar 2017)
    45 scene categories from airplane to wetland, 31,500 images (700 per category, 256x256 px), image chips taken from Google Earth (rich image variations in resolution, angle, geography all over the world), Download Link, Paper: Cheng et al. 2017

  • Deepsat: SAT-4/SAT-6 airborne datasets (Louisiana State University, 2015)
    6 land cover categories, 400k 28x28 pixel chips, 4-band RGBNIR aerial imagery (1m res.) extracted from the 2009 National Agriculture Imagery Program (NAIP), Paper: Basu et al. 2015

  • UC Merced Land Use Dataset (UC Merced, Oct 2010)
    21 land cover categories from agricultural to parkinglot, 100 chips per class, aerial imagery (0.30m res.), Paper: Yang & Newsam 2010

5. Other Focus / Multiple Tasks

More Resources

Owner
Christoph Rieke
Geospatial Engineer
Christoph Rieke
Model-free Vehicle Tracking and State Estimation in Point Cloud Sequences

Model-free Vehicle Tracking and State Estimation in Point Cloud Sequences 1. Introduction This project is for paper Model-free Vehicle Tracking and St

TuSimple 92 Jan 03, 2023
KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch

KoRean based ELECTRA (KR-ELECTRA) This is a release of a Korean-specific ELECTRA model with comparable or better performances developed by the Computa

12 Jun 03, 2022
Baseline powergrid model for NY

Baseline-powergrid-model-for-NY Table of Contents About The Project Built With Usage License Contact Acknowledgements About The Project As the urgency

Anderson Energy Lab at Cornell 6 Nov 24, 2022
Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

The Boombox: Visual Reconstruction from Acoustic Vibrations Boyuan Chen, Mia Chiquier, Hod Lipson, Carl Vondrick Columbia University Project Website |

Boyuan Chen 12 Nov 30, 2022
TensorFlow implementation of Deep Reinforcement Learning papers

Deep Reinforcement Learning in TensorFlow TensorFlow implementation of Deep Reinforcement Learning papers. This implementation contains: [1] Playing A

Taehoon Kim 1.6k Jan 03, 2023
dyld_shared_cache processing / Single-Image loading for BinaryNinja

Dyld Shared Cache Parser Author: cynder (kat) Dyld Shared Cache Support for BinaryNinja Without any of the fuss of requiring manually loading several

cynder 76 Dec 28, 2022
Patch2Pix: Epipolar-Guided Pixel-Level Correspondences [CVPR2021]

Patch2Pix for Accurate Image Correspondence Estimation This repository contains the Pytorch implementation of our paper accepted at CVPR2021: Patch2Pi

Qunjie Zhou 199 Nov 29, 2022
Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

Automated Side Channel Analysis of Media Software with Manifold Learning Official implementation of USENIX Security 2022 paper: Automated Side Channel

Yuanyuan Yuan 175 Jan 07, 2023
🐸STT integration examples

🐸 STT 0.9.x Examples These are various examples on how to use or integrate 🐸 STT using our packages. It is a good way to just try out 🐸 STT before

coqui 92 Dec 19, 2022
The Video-based Accident Detection System built in Python

Accident-detection-system About the Project This Repository contains the Video-based Accident Detection System built in Python. Contributors Yukta Gop

SURYAVANSHI SNEHAL BALKRISHNA 50 Dec 07, 2022
Redash reset for python

redash-reset This will use a default REDASH_SECRET_KEY key of c292a0a3aa32397cdb050e233733900f this allows you to reset the password of the user ID bu

Robert Wiggins 5 Nov 14, 2022
PyTorch implementation of residual gated graph ConvNets, ICLR’18

Residual Gated Graph ConvNets April 24, 2018 Xavier Bresson http://www.ntu.edu.sg/home/xbresson https://github.com/xbresson https://twitter.com/xbress

Xavier Bresson 112 Aug 10, 2022
keyframes-CNN-RNN(action recognition)

keyframes-CNN-RNN(action recognition) Environment: python=3.7 pytorch=1.2 Datasets: Following the format of UCF101 action recognition. Run steps: Mo

4 Feb 09, 2022
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Conceptual 12M We introduce the Conceptual 12M (CC12M), a dataset with ~12 million image-text pairs meant to be used for vision-and-language pre-train

Google Research Datasets 226 Dec 07, 2022
使用深度学习框架提取视频硬字幕;docker容器免安装深度学习库,使用本地api接口使得界面和后端识别分离;

extract-video-subtittle 使用深度学习框架提取视频硬字幕; 本地识别无需联网; CPU识别速度可观; 容器提供API接口; 运行环境 本项目运行环境非常好搭建,我做好了docker容器免安装各种深度学习包; 提供windows界面操作; 容器为CPU版本; 视频演示 https

歌者 16 Aug 06, 2022
Use deep learning, genetic programming and other methods to predict stock and market movements

StockPredictions Use classic tricks, neural networks, deep learning, genetic programming and other methods to predict stock and market movements. Both

Linda MacPhee-Cobb 386 Jan 03, 2023
No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency

This repository contains the implementation for the paper: No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consiste

Alireza Golestaneh 75 Dec 30, 2022
Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Open-L2O This repository establishes the first comprehensive benchmark efforts of existing learning to optimize (L2O) approaches on a number of proble

VITA 161 Jan 02, 2023
3D ResNet Video Classification accelerated by TensorRT

Activity Recognition TensorRT Perform video classification using 3D ResNets trained on Kinetics-400 dataset and accelerated with TensorRT P.S Click on

Akash James 39 Nov 21, 2022
Tutorials, assignments, and competitions for MIT Deep Learning related courses.

MIT Deep Learning This repository is a collection of tutorials for MIT Deep Learning courses. More added as courses progress. Tutorial: Deep Learning

Lex Fridman 9.5k Jan 07, 2023