Cicadian Rhythm

Cicadas are loud. And while some are around every year, there is at least one species that lives for up to seventeen years. They mate every seventeen years. In 2013 we are witnessing the emergence of Brood II, one of the numerous clusters that are synchronised to each other. But when and where have other broods been spotted over time? The New York Times looked at this question last week via an interactive graphic.

Brood II Cicadas
Brood II Cicadas

Credit for the piece goes to Jonathan Corum.

Comparing Medical Cost Comparisons

Yesterday both the New York Times and the Washington Post published fascinating pieces looking at the difference in the cost of medical procedures. But each took a different approach.

I want to start with the New York Times, which focused at the hospital level because the data is available at that level of granularity. They created a geo-tagged map where hospitals were colour-coded by whether their bills were below, slightly above, or significantly above the US average.

Hospitals across the United States
Hospitals across the United States

The ability to search for a specific town allows people to search for their hometown, state, country and then compare that to everyone else. My hometown of West Chester, Pennsylvania is fortunate—or perhaps not—to have several hospitals in the area that charge at different rates. That makes for an interesting story. But I am from the densely populated East Coast and someone from say rural Montana might not have the same sort of interesting view.

Hospitals near West Chester, Pennsylvania
Hospitals near West Chester, Pennsylvania

Regardless of the potential for uninteresting small-area comparisons, once you find your hospital, you can click it to bring up detailed statistics for procedures, costs, and comparisons to the average.

Brandywine Hospital's data
Brandywine Hospital's data

All of this makes for a very granular and very detailed breakdown of hospital versus hospital coverage. But what if you want something broader? What good is comparing Brandywine Hospital to some medical centre in Chicago? Neither is reflective of the healthcare industry in the Philadelphia area or the Chicago area, let alone Pennsylvania or Illinois. The Washington Post tackles this broader comparison.

The Post leads off with a hospital-level example from Miami. Two hospitals on one street have vastly different prices. If we knew about this in Miami we could surely find that in the New York Times map. Instead, the Post guides us to that kind of example.

Comparing two hospitals in Miami
Comparing two hospitals in Miami

But the broader view is the centre of the piece. Using dot plots and filters, the user can compare the state averages for 10 different medical procedures. Fixed to the plot are the minimum and maximum averages along with the national average. And given the Post’s smaller circulation area—the New York Times is national, the Post is less so—there are quick links to states of particular interest: DC, Maryland, and Virginia.

Pennsylvan's averages
Pennsylvan's averages

The ability to pick different states from the drop down menu allows the user to quickly see differences between states. What is lacking is perhaps a quick view of where all the states are visible so that the user does not have to click through each individual state.

California's averages
California's averages

Both pieces are very successful at their narrowly-focused aims. Neither tries to do everything all at once, but nor would their designs allow for it. Plotting and filtering all the hospitals could be done in the Post’s style, but it would be messy. The state averages could all be made to colour state shape files, but you would lose the inter-procedure differences, the minimums, maximums, and the averages. In short the two pieces from the two teams complement each other very well, but a weird and hybrid-y cross of the two would be large, cumbersome, and potentially difficult to use without spending a lot of time to design and develop the solution. (Which I imagine they did not have.)

Credit for the piece from the New York Times goes to Matthew Bloch, Amanda Cox, Jo Craven McGinty, and Matthew Ericson.

Credit for the piece from the Washington Post goes to Wilson Andrews, Darla Cameron, and Dan Keating.

The Gap in University Admissions

The New York Times has recently done good work with interactive infographics that weave a narrative through their chosen form of data visualisation. I covered one such work back in February that looked at girls in science. Today, a similarly structured piece looks at university admissions and graduation rates for ethnic minorities.

Admissions Gap at Universities
Admissions Gap at Universities

Navigation in the top-right guides the user through the story with key schools highlighted. Of course at any time the user can dive into the data and find specific schools that interest them. Overall the piece is less about data exploration, however, and instead merely uses the wealth of data to paint a context for the broader narrative.

Credit for the piece goes to Josh Keller.

Nate Silver Predicts the Presidential Election

Of 2048. Well, kind of. Lately the country has been talking a lot about immigration and its impacts because of this bipartisan desire to achieve some kind of result on an immigration bill working its way through the Senate. One of the common thoughts is that if we legalise a whole bunch of illegals or document most of the undocumented (I’ll leave the language for you to decide), the new American citizens will overwhelmingly vote Democratic and there goes the Republic(an Party).

Nate Silver—yes, that Nate Silver who accurately predicted the presidential results and a whole bunch of other stuff too—looked at a more complex and more nuanced set of demographic variables and found that the aforementioned argument greatly oversimplifies the results. The problem is not entirely the entry of newly documented or illegal workers. Instead, there are systemic demographic issues.

So here comes the New York Times with an excellent data explorer and forecast modeller. You can set the year to examine and then set the results of the immigration debate with how many immigrants are made legal/documented and then how many of them vote. After that you can begin to adjust population growth, voting patterns, &c. to see how those affect the elections. (The obvious caveats of acts of god, party platforms, candidates, &c. all hold.)

2048 Results
2048 Results

The fascinating bit is that if you keep the demographic patterns as they are currently, adjusting the immigration factors at the outset have very little impact on the results. The country is moving towards the current Democratic platform. Even if 0% of the undocumented/illegal immigrants become documented/legal, and if 0% of 0% vote, the result is still a landslide for the Democrats. The real changes begin to happen if you adjust the population growth rates of the legal/documented citizens and voters. But those patterns/behaviours are a lot more difficult to adjust since you can’t legislate people to have more babies.

All in all a fascinating piece from the New York Times. The controls are fairly intuitive, drag sliders to adjust percentages. The sliders have clear labels. And the results on the map are instantaneous. Perhaps the only quirk is that the ranges of the colours are not detailed. But that might be a function of forecasting the data so far into the future and having growing ranges of certainty.

Credit for the piece goes to Matthew Bloch, Josh Keller, and Nate Silver.

Analysing Your (Facebook) Social Networks

Earlier this week, Wolfram Alpha released some findings from its analytics project on Facebook. While the results offer quite a bit to digest, the use of some data visualisation makes it a little bit easier. And a lot more interesting.

The results offer quite a bit of detail on interests, relationship statuses, geographic locations, and ages. Below is just one of the small multiple sets, this one looks at the number of friends of different ages for people of different ages. Basically, how many young or old people are friends of young people? Friends of old people?

Friends of Friends for the Ages
Friends of Friends for the Ages

But I was most interested in the analysis of social networks. The mosaic below is indicative of the sheer size of the survey, but also begins to hint at the variance in the social structures of the data donors.

Just Some of the Networks
Just Some of the Networks

While these views are all neat, where it begins to get really interesting is Wolfram Alpha’s work on classifying the different types of social networks. By aggregating and averaging out clusters, simple forms begin to emerge. And after those forms emerged, they were quantified and the results are a simple bar chart showing the distribution of the different types of networks.

Simplified Cluster Distribution
Simplified Cluster Distribution

Overall, some very interesting work. But one might naturally wonder how their own networks are structured. Or just be curious to look at the data visualisation of their own Facebook profile. Or maybe only some of us would. Fortunately, you still can link your account to a Wolfram Alpha account (you have to pay for advanced features, however) and get a report. Below is the result of my network, for those who know me semi-well I have labelled the different clusters to show just how the clustering works.

My Social Networks
My Social Networks

Credit for the piece goes to Wolfram Alpha.

Recapping the Boston Marathon Bombing

After the capture this weekend of the second suspect in the Boston Marathon bombing, the Washington Post published an interactive piece that looked at the entirety of the story. It captured the bombing, looked at the investigation, then the manhunt, and finally the capture of one of the suspects.

The piece incorporated static diagrams along with video and interactive navigation to tell the story in day-sized chunks on the screen. When taken together as one whole piece, it is quite impressive.

The capture of one of the suspects
The capture of one of the suspects

Credit for the piece goes to what may well be the entire graphics staff of the Washington Post: Wilson Andrews, Darla Cameron, Emily Chow, Alberto Cuadra, Kat Downs, Laris Karklis, Todd Lindeman, Katie Park, Gene Thorp, and Karen Yourish.

The Universe Keeps Getting Bigger

The Kepler observatory is responsible for finding Earth-like planets in distant solar systems. It was launched only in 2009, but has been incredibly successful. Earlier this week scientists announced the discovery of Kepler 62, a star system that has five planets. Two of those planets exist within the Goldilocks zone, where conditions are just right for habitable planets (for Earth-like organisms) to form and exist. Of course, not all planets in such zones are habitable, look at Venus and Mars for examples. But still, the news is quite significant.

Over at the New York Times, Jonathan Corum plotted all the data on all the systems so far discovered by Kepler, including that new information on Kepler 62. The result is a mesmerising view of star systems beyond our own. The stars are planets are enlarged for visibility and the orbits are made a bit more circular, but the overview is still fantastic.

The chart shows the relative sizes of the stars and their temperatures and allows you to compare the orbits of the planets so far known. You can also sort the chart either by size or time of discovery. It also shows the relative times of the planets’ orbits. That is, they move…

The Kepler star systems
The Kepler star systems

Fans of this will remember that in 2011, the New York Times used a similar, albeit static, method to explain the discovery of planets at Kepler 20, whose planets all orbit closer to their star than Mercury to ours.

Credit for the piece goes to Jonathan Corum.

The Boston Marathon Bombing in More Detail

Today’s post comes from the New York Times and offers more detail on the twin terror bombings Monday. While the topic is surely gruesome, the interactive graphic is clean without the inclusion of photos or videos of the violence. It focuses on the facts without the fanfare or sensationalism to which we are accustomed from a media that too often needs it to draw and sustain viewership. A sober and informative piece from the New York Times.

Details of the Boston Marathon Bombing
Details of the Boston Marathon Bombing

Credit for the piece goes to the graphics department of the Times.

Cartograms

Continuing this week’s map theme, we have an example of a cartogram from the New York Times. This piece supplements an article about how some manufacturing companies are starting to look away from China as a place for their facilities. There are two maps, the first (not shown here) looks at economic output overall. The second (below) takes that output and accounts for population.

GDP per capita
GDP per capita

Hexagons are used instead of the more familiar squares to represent 500,000 people and the colour is the GDP per capita. The text accompanying the graphic explains how this is a measure of economic potential being (or not being) realised. But what the hexagons allow the map to do is better represent the shapes of the countries. Squares, more common in cartograms, create awkward box-like outlines of countries. That would be fine if countries were often shaped like squares, but they are not.

I am not often a fan of cartograms, but I find this one well executed and the annotations and explanatory text make what might otherwise be confusing far simpler to understand. All in all, a solid piece.

Credit for the piece goes to Mike Bostock and Keith Bradsher.

Women in the US Senate

Women are half the population, but only twenty percent of the upper chamber of the United States Congress. As this great interactive timeline from the New York Times shows, at least that inequity has been narrowing over the last several elections.

The infographic comes in two main views. The first highlights women in the Senate and assigns them chronologically and then colours them by party. Important or notable senators are annotated appropriately. This view also shows the breakdown of women in the Senate at any one time in the small chart in the upper-left. Mousing over senators then provides a little bit of information about the woman in question.

Women in the US Senate
Women in the US Senate

But to put it in perspective, by selecting All Senators, the user can see the whole elected history of the US Senate—remember, prior to the 1920s you did not directly elect senators. That makes for a lot of grey bars, i.e. a lot of men.

Men and Women in the US Senate
Men and Women in the US Senate

Credit for the piece goes to Hannah Fairfield, Alan McLean and Derek Willis.