Flight Delay Prediction

Our objective is to predict arrival delays of commercial flights. According to the US Department of Transportation, about 21% of commercial flights scheduled between June 2003 and October 2021 have experienced some form of delay. It is critical for airlines to estimate flight delays as accurately as possible in order to improve customer satisfaction and optimize the income of airline agencies. This project will be evaluated on the basis of arrival delay prediction accuracy for flights

Contributors

Jordan Silke
Jonas Bacareza

Understanding the problem

In an effort to understand some common causes of commercial flight delays, a number of sources were consulted including government agencies and flight-focused blog posts. A brief overview of findings can be found in the Research directory. These common causes will inform feature selection and engineering decisions.

Data description

Data was sourced from a LHL PostgreSQL database and descriptions were provided for each table. We used a custom script to extract the feature names from these description files and the raw data can be found here. The rationale behind missing value processing can be reviewed and reproduced by reading and executing the data_overview notebook. The data from the flights table included in this repository is a randomly sampled subset of the source table.

Recommended exploration

Task	Status
Test the hypothesis that the arrival delay is from Normal distribution and that mean of the delay is 0. Be careful about the outliers.	✅
Is average/median monthly delay different during the year? If so, which months have the biggest delays and what could be the reason?	✅
Does the weather affect the delay?	🧰
How are taxi times changing during the day? Does higher traffic lead to longer taxi times?	✅
What is the average percentage of delays that exist prior to departure (i.e. are arrival delays caused by departure delays)? Are airlines able to lower the delay during the flights?	✅
How many states cover 50% of US air traffic?	✅
Test the hypothesis that planes fly faster when there is a departure delay.	✅
When (which hour) do most 'LONG', 'SHORT', 'MEDIUM' haul flights take off?	🔳
Find the top 10 the bussiest airports. Does the greatest number of flights mean that the majority of passengers went through a given airport? How much traffic do these 10 airports cover?	🔳
Do bigger delays lead to bigger fuel consumption per passenger?	🔳

🔳 - To do.
✅ - Core task 'complete' (at least a first pass).
🧰 - Work in progress.

Exploration task results can be found here

Predicting the duration of arrival delays for commercial flights.

Related tags

Overview

Flight Delay Prediction

Contributors

Understanding the problem

Data description

Recommended exploration

Owner

Jordan Silke

[Arxiv preprint] Causality-inspired Single-source Domain Generalization for Medical Image Segmentation (code&data-processing pipeline)

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

GitHub repository for "Improving Video Generation for Multi-functional Applications"

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

TensorFlow 2 AI/ML library wrapper for openFrameworks

Hepsiburada - Hepsiburada Urun Bilgisi Cekme

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

Explainable Zero-Shot Topic Extraction

make ASCII Art by Deep Learning

Semiconductor Machine learning project

This is a code repository for paper OODformer: Out-Of-Distribution Detection Transformer

Linear image-to-image translation

Implementation of " SESS: Self-Ensembling Semi-Supervised 3D Object Detection" (CVPR2020 Oral)

Semantic Segmentation Architectures Implemented in PyTorch

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Optimize Trading Strategies Using Freqtrade

J.A.R.V.I.S is an AI virtual assistant made in python.

This was initially the repo for the project of [email protected] of Asaf Mazar, Millad Kassaie and Georgios Chochlakis named "Powered by the Will? Exploring Lay Theories of Behavior Change through Social Media"

tensorrt int8 量化yolov5 4.0 onnx模型

Implementation of MeMOT - Multi-Object Tracking with Memory - in Pytorch