Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Last update: Dec 01, 2021

Related tags

Overview

opendata

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

import asyncio
from opendata.sources.bikeshare.bay_wheels import trips as bay_wheels

trips_df, _ = asyncio.run(bay_wheels.async_load(trip_sample_rate=1000))

len(trips_df.index)
# 8731

trips_df.columns
# Index(['started_at', 'ended_at', 'start_station_id', 'end_station_id',
#        'start_station_name', 'end_station_name', 'rideable_type', 'ride_id',
#        'start_lat', 'start_lng', 'end_lat', 'end_lng', 'gender', 'user_type',
#        'bike_id', 'birth_year'],
#       dtype='object')

An example analysis can be found here: https://observablehq.com/@brady/bikeshare

Supports sampling and local file caching to improve performance.

Markets supported

import opendata.sources.bikeshare.bay_wheels
import opendata.sources.bikeshare.bixi
import opendata.sources.bikeshare.divvy
import opendata.sources.bikeshare.capital_bikeshare
import opendata.sources.bikeshare.citi_bike
import opendata.sources.bikeshare.cogo
import opendata.sources.bikeshare.niceride
import opendata.sources.bikeshare.bluebikes
import opendata.sources.bikeshare.metro_bike_share
import opendata.sources.bikeshare.indego

Bootstrap

Set up your environment

brew install chromedriver
brew install python3
python3 -m pip install pre-commit

pre-commit install --install-hooks
python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Entering virtualenv

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Usage

Try the test export to CSV:

python3 test.py

Updating pip requirements

pip-compile

Pre-commit setup

pre-commit install --install-hooks

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Related tags

Overview

opendata

Markets supported

Bootstrap

Entering virtualenv

Usage

Updating pip requirements

Pre-commit setup

Bikeshare markets to add

USA

World

Owner

Brady Law

Python Implementation of Scalable In-Memory Updatable Bitmap Indexing

EOD Historical Data Python Library (Unofficial)

sportsdataverse python package

An easy-to-use feature store

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

2019 Data Science Bowl

High Dimensional Portfolio Selection with Cardinality Constraints

Exploring the Top ML and DL GitHub Repositories

Cleaning and analysing aggregated UK political polling data.

Deep universal probabilistic programming with Python and PyTorch

Weather analysis with Python, SQLite, SQLAlchemy, and Flask

Import, connect and transform data into Excel

Data Intelligence Applications - Online Product Advertising and Pricing with Context Generation

NFCDS Workshop Beginners Guide Bioinformatics Data Analysis

Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames

Catalogue data - A Python Scripts to prepare catalogue data

ICLR 2022 Paper submission trend analysis

Provide a market analysis (R)

Statistical Rethinking course winter 2022

Vectorizers for a range of different data types