With the recent expansion of Tableau Public to accommodate data sources up to #AMillionRows, we kicked off "Data Month" here at Tableau. Earlier this week, Tableau Public data analyst Jewel Loree gave some sound words of advice about working with these larger data sets, and now I'd like to provide some resources for those of you looking to find data to play with.
The problem isn't finding sources of data, it's narrowing down what's out there.
With the Open Data movement, governments at various levels around the world have created portals where interested parties can access data about that specific part of the world. The United States government provides a list of 292 Open Data sites around the world, and we created a viz that allows you to find one that you're interested in:
If what you're looking for are data sets greater than the previous data limit of 100,000 rows but less than the current limit of 1,000,000 rows, here are a few that we've found for you:
2012 UK Road Safety data (145K accidents) - data | website
2011 U.S. Medicare inpatient charge data (170K rows) - data | website
2011 NYPD “Stop-and-Frisk” data (685K incidents) - data | website
Recently launched Quandl aims to be the Google of quantitative information. We tried it out, and it's pretty impressive. Enter something like "Price of Gold" in the search box and you'll see a line chart and raw data of the price of gold going all the way back to the late 1960's, with options to download the raw data in a number of formats.
“Like monks must have done when printing presses began producing books for the masses, many priests of business intelligence will stand aside, arms folded in the aspe chapel. But I predict that before long even they will appreciate a wider, deeper pool of analytical talent ripening for training and employment (from Tableau Public).”