We've seen the map of where everyone lives. Now here's the reverse of that by Nik Freeman: where nobody lives in the United States.
Sebastian Raschka offers a step-by-step tutorial for a principal component analysis in Python. The main purposes of a principal component analysis are the analysis of data to identify patterns and finding patterns to reduce the dimensions of the dataset with minimal loss of information.
As a lesson on conditional probability for himself, Walt Hickey watched 403 episodes of "The Joy of Painting" with Bob Ross, tagged them with keywords on what Ross painted, and examined Ross's tendencies.
This chart-map-looking thing from Nightly News is making the rounds, and it's not good. I'm opening the comments below for critique so that you can release your angst.
Earthquakes are in the news a lot lately. A quick search shows a 7.6 off the coast of the Solomon Islands, a 6.6 in Nicaragua, and a 7.1 off the southwest coast of Papua New Guinea, and this was just last week.
This year's polar vortex churned up some global warming skeptics, but as we know, it's more useful to look at trends over significant spans of time than isolated events.
Pornhub continues their analysis of porn viewing demographics in their latest comparison of pageviews per capita between red and blue states (SFW for most, I think).
Looking for a job in data science, visualization, or statistics? There are openings on the board. Digital Designer, Editorial Content for Bauer Media in Central London.
The American Community Survey, an ongoing survey that the Census administers to millions per year, provides detailed information about how Americans live now and decades ago.
Stephen Pettigrew, for Regressing, compared 11 million brackets on ESPN.com against those of pundits.
Open data consultancy Conveyal released Disser, a command-line tool to disaggregate geographic data to show more details.
As part of the You Are Here project from the MIT Media Lab, an exploration of independent coffee shops in San Francisco: Independent coffee shops are positive markers of a living community.
Tabula, by Manuel Aristarán, came out months ago, but I've been poking at government data recently and came back to this useful piece of free software to get the data tables out of countless free-floating PDF files.
FloatingSheep pointed their Twitter geography towards beer (and wine). From Sam Adams in New England to Yuengling in Pennsylvania to Grain Belt and Schlitz in the upper Midwest, these beers are quite clearly associated with particular places.
Because Fox News. [Thanks, Meron]
Tim Harford for Financial Times on big data and how the same problems for small data still apply: The multiple-comparisons problem arises when a researcher looks at many possible patterns.
The New York Public Library announced open access to 20,000 maps, making them free to download and use.
From Cakecrumbs, a product that helps you learn while you eat: planetary layer cakes. The graduate student slash baker hobbyist's sister asked if she could make one, and at first she thought it couldn't be done.
Citi Bike, also known as NYC Bike Share, is releasing monthly data dumps for station check-outs and check-ins, which gives you a sense of where and when people move about the city.
Remember the Million Dollar Homepage from 2005? It sold ad space to anyone who was interested for one dollar per pixel, and there were one million pixels available.