Analyzing node.js on GitHub with BigQuery
· 6 min read
As someone who works on developer tooling - GitHub is the holy grail of data sets. There's just so much code out there, written by so many people, for so many reasons. I've often wished I could just clone all of the data on GitHub, and then write scripts to process the data for various reasons:
- What are the top 1k npm modules used with Node.js apps? We want to know this so we can test them with App Engine.