⚾🤖⚾ Automatic baseball pitching overlay in realtime

Last update: Dec 05, 2022

Overview

⚾ Automatically overlaying pitch motion and trajectory with machine learning!

This project takes your baseball pitching clips and automatically generates the overlay. The input pitching clip could be directly from your phone or camera. The release point will be automatically detected by the program. This system will trace the trajectory and align all the videos to generate the overlay.

A fine-tuned YOLOv4 model is used to get the location of the ball. Then, I implemented SORT tracking algorithm to keep track of each individual ball. Lastly, I have applied some image registration techniques to deal with slight camera shift on each clip.

I'm still trying to improve it! Feel free to follow this project, also check out the Todo list.

The idea came from this incredible overlay.

💻 Getting Started

These instructions will get you a copy of the project, and generates your own pitching overlay clip!

Get a copy

Get a copy of this project by simply running the git clone command.

git clone https://github.com/chonyy/ML-auto-baseball-pitching-overlay.git

Prerequisites

Before running the project, we have to install all the dependencies from requirements.txt

pip install -r requirements.txt

Overlay!

Last, run the project with your own clips!

Try a sample

python pitching_overlay.py

Try with yout own clips

Place your pitching videos in a folder, then specify the path in the CLI.

python pitching_overlay.py --videos_folder "./videos/videos"

🔨 Project Structure

🎬 More Demo

☑️ Todo

Implement image registration to deal with camera shift
Build a demo web app for people to use it in realtime on web
Enable custom parameter tuning
Improve the visual effect
Write a Medium post to explain the technical workflow
Draw a structure diagram

⚾🤖⚾ Automatic baseball pitching overlay in realtime

Related tags

Overview

💻 Getting Started

Get a copy

Prerequisites

Overlay!

Try a sample

Try with yout own clips

🔨 Project Structure

🎬 More Demo

☑️ Todo

Owner

Tony Chou

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation

Speedy Implementation of Instance-based Learning (IBL) agents in Python

Trying to understand alias-free-gan.

DCSAU-Net: A Deeper and More Compact Split-Attention U-Net for Medical Image Segmentation

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

SNIPS: Solving Noisy Inverse Problems Stochastically

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning.

Lane assist for ETS2, built with the ultra-fast-lane-detection model.

Really awesome semantic segmentation

EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling

Official Implementation of SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

(NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"

Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance