Code in support of this post: A Tale of Twenty-Two Million Citi Bikes: Analyzing the NYC Bike Share System
This repo provides scripts to download, process, and analyze data for NYC's Citi Bike system data. The data is stored in a PostgreSQL database, uses PostGIS for spatial calculations, and R for data analysis.
Pretty much a copy of the taxi/Uber data repo, at some point the Citi Bike, taxi, and Uber datasets could probably be combined into a single unified NYC transit database...
1. Install PostgreSQL and PostGIS
Both are available via Homebrew on Mac OS X
./download_raw_data.sh
./initialize_database.sh
./import_trips.sh
Additional Postgres and R scripts for analysis are in the analysis/
folder
These are bundled with the repository, so no need to download separately, but:
- Shapefile for NYC census tracts and neighborhood tabulation areas comes from Bytes of the Big Apple
- Central Park weather data comes from the National Climatic Data Center
todd@toddwschneider.com, or open a GitHub issue