Sample code for Harry's Airflow online trainng course

Last update: Dec 30, 2022

Related tags

Data Analysis airflow_course

Overview

Sample code for Harry's Airflow online trainng course

You can find the videos on youtube or bilibili.

I am working on adding below things:

the slide pdf files(done)
another video about creating custom operators
docker-compose with CeleryExecutors.

Owner

GitHub Repository

Active Learning demo using two small datasets

ActiveLearningDemo How to run step one put the dataset folder and use command below to split the dataset to the required structure run utils.py For ea

3 Nov 10, 2021

PATC: Introduction to Big Data Analytics. Practical Data Analytics for Solving Real World Problems

1 Feb 07, 2022

Statistical Analysis 📈 focused on statistical analysis and exploration used on various data sets for personal and professional projects.

Statistical Analysis 📈 This repository focuses on statistical analysis and the exploration used on various data sets for personal and professional pr

1 Sep 03, 2022

Learn machine learning the fun way, with Oracle and RedBull Racing

Red Bull Racing Analytics Hands-On Labs Introduction Are you interested in learning machine learning (ML)? How about doing this in the context of the

55 Oct 24, 2022

This creates a ohlc timeseries from downloaded CSV files from NSE India website and makes a SQLite database for your research.

NSE-timeseries-form-CSV-file-creator-and-SQL-appender- This creates a ohlc timeseries from downloaded CSV files from National Stock Exchange India (NS

1 Oct 02, 2022

Bearsql allows you to query pandas dataframe with sql syntax.

Bearsql adds sql syntax on pandas dataframe. It uses duckdb to speedup the pandas processing and as the sql engine

14 Jun 22, 2022

Clean and reusable data-sciency notebooks.

KPACUBO KPACUBO is a set Jupyter notebooks focused on the best practices in both software development and data science, namely, code reuse, explicit d

1 Jan 28, 2022

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

2 Dec 01, 2021

Includes all files needed to satisfy hw02 requirements

HW 02 Data Sets Mean Scale Score for Asian and Hispanic Students, Grades 3 - 8 This dataset provides insights into the New York City education system

7 Oct 28, 2021

An experimental project I'm undertaking for the sole purpose of increasing my Python knowledge

5ePy is an experimental project I'm undertaking for the sole purpose of increasing my Python knowledge. #Goals Goal: Create a working, albeit lightwei

1 Nov 24, 2021

Streamz helps you build pipelines to manage continuous streams of data

Streamz helps you build pipelines to manage continuous streams of data. It is simple to use in simple cases, but also supports complex pipelines that involve branching, joining, flow control, feedbac

1.1k Dec 28, 2022

Data-sets from the survey and analysis

bachelor-thesis "Umfragewerte.xlsx" contains the orginal survey results. "umfrage_alle.csv" contains the survey results but one participant is cancele

1 Jan 26, 2022

📊 Python Flask game that consolidates data from Nasdaq, allowing the user to practice buying and selling stocks.

Web Trader Web Trader is a trading website that consolidates data from Nasdaq, allowing the user to search up the ticker symbol and price of any stock

21 Aug 30, 2022

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

2 Nov 20, 2021

Sample code for Harry's Airflow online trainng course

Related tags

Overview

Owner

Active Learning demo using two small datasets

PATC: Introduction to Big Data Analytics. Practical Data Analytics for Solving Real World Problems

Statistical Analysis 📈 focused on statistical analysis and exploration used on various data sets for personal and professional projects.

Learn machine learning the fun way, with Oracle and RedBull Racing

This creates a ohlc timeseries from downloaded CSV files from NSE India website and makes a SQLite database for your research.

Bearsql allows you to query pandas dataframe with sql syntax.

Clean and reusable data-sciency notebooks.

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Includes all files needed to satisfy hw02 requirements

An experimental project I'm undertaking for the sole purpose of increasing my Python knowledge

Streamz helps you build pipelines to manage continuous streams of data

Data-sets from the survey and analysis

📊 Python Flask game that consolidates data from Nasdaq, allowing the user to practice buying and selling stocks.

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

An Aspiring Drop-In Replacement for NumPy at Scale

A real data analysis and modeling project - restaurant inspections

Utilize data analytics skills to solve real-world business problems using Humana’s big data

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

For making Tagtog annotation into csv dataset

Average time per match by division