Comments

Data, data, everywhere

With the recent expansion of Tableau Public to accommodate data sources up to #AMillionRows, we kicked off "Data Month" here at Tableau. Earlier this week, Tableau Public data analyst Jewel Loree gave some sound words of advice about working with these larger data sets, and now I'd like to provide some resources for those of you looking to find data to play with.

The problem isn't finding sources of data, it's narrowing down what's out there.

With the Open Data movement, governments at various levels around the world have created portals where interested parties can access data about that specific part of the world. The United States government provides a list of 292 Open Data sites around the world, and we created a viz that allows you to find one that you're interested in:

We also have a Data Catalog on our site where we've provided links to a number of data sites, including non-government sites that provide data about other subjects like sports and economics:

If what you're looking for are data sets greater than the previous data limit of 100,000 rows but less than the current limit of 1,000,000 rows, here are a few that we've found for you:

  • 2012 UK Road Safety data (145K accidents) - data | website
  • 2011 U.S. Medicare inpatient charge data (170K rows) - data | website
  • 2011 NYPD “Stop-and-Frisk” data (685K incidents) - data | website

Recently launched Quandl aims to be the Google of quantitative information. We tried it out, and it's pretty impressive. Enter something like "Price of Gold" in the search box and you'll see a line chart and raw data of the price of gold going all the way back to the late 1960's, with options to download the raw data in a number of formats.

Quora has a great discussion thread under "Where can I find large datasets open to the public" that's worth scanning down, and there's also a reddit site at /r/datasets that can be helpful.

Lastly, we're keeping an eye on the second coming of Data.gov over at Next.Data.Gov. It will be interesting to see whether this new site delivers on the promise of the first.

If you know of any other great sources of freely available online data, please feel free to leave a comment so we all can benefit.

Happy hunting!
Ben

Comments

Nice article. I have been looking for it and it help alot. taxi service sudbury ma

Thanks for taking the time to post about the Texas Comptroller's office! I look forward to diving into the data available there.

The Texas Comptroller's Office has had an open data center for several years and we have just upgraded it.

http://www.texastransparency.org/opendata/index.php

We also have a site where we collect and analyze academic, financial and demographic data from all the public schools and districts in the state. Those files are available for download in the Open Data Center and here:

http://www.fastexas.org/results/downloads.php

non-humans click here