Skip to content

JiaXu1024/nyc-citibike-data

 
 

Repository files navigation

NYC Citi Bike Data

Code in support of this post: A Tale of Twenty-Two Million Citi Bikes: Analyzing the NYC Bike Share System

This repo provides scripts to download, process, and analyze data for NYC's Citi Bike system data. The data is stored in a PostgreSQL database, uses PostGIS for spatial calculations, and R for data analysis.

Pretty much a copy of the taxi/Uber data repo, at some point the Citi Bike, taxi, and Uber datasets could probably be combined into a single unified NYC transit database...

Instructions

1. Install PostgreSQL and PostGIS

Both are available via Homebrew on Mac OS X

2. Download raw taxi data

./download_raw_data.sh

3. Initialize database and set up schema

./initialize_database.sh

4. Import taxi data into database and map to census tracts

./import_trips.sh

5. Analysis

Additional Postgres and R scripts for analysis are in the analysis/ folder

Other data sources

These are bundled with the repository, so no need to download separately, but:

Questions/issues/contact

todd@toddwschneider.com, or open a GitHub issue

About

NYC Citi Bike system data and analysis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 90.4%
  • Ruby 5.4%
  • Shell 4.2%