Skip to content

Latest commit

 

History

History
44 lines (42 loc) · 5.13 KB

README.md

File metadata and controls

44 lines (42 loc) · 5.13 KB

We prepared a dataset from the GH Archive that contains all the events in all GitHub repositories since 2011 in structured format. The dataset was uploaded into ClickHouse, where it contains 3.1 billion records. We redistribute it for research purposes and it can be downloaded at this direct link. This dataset can help answer almost any question about GitHub that you can imagine.

Read the article