Is Bryce Harper Another Ryan Howard?

No. Definitely not. But, the position of this article by FiveThirtyEight is that the Phillies, the Philadelphia baseball team that just made the largest guaranteed contract in North American sports, may have purchased the rights to somebody who is a few years past his prime.

The author tracked the performance of similar baseball players over history and found that they peaked earlier and tailed off earlier.

Ken Griffey Jr.'s swing though…
Ken Griffey Jr.’s swing though…

Now, the obvious thing about this graphic that I dislike is the spaghetti-fication of the lines. What does help alleviate it, however, are the greying and lighter weight of the non-identified lines in the background. Interestingly, they are even lighter than the axes’ value lines. There is also a thin outline to the lines that helps them standout against each other.

I also wonder if a few more added benchmark lines would be useful. Elite seasons are defined as those with 8+ wins above replacement (WAR), an advanced measurement statistic. Could that level not be indicated with a line on the y-axis? What about the age of 26, before which the players would have had to produce one and only one 8+ WAR season to be eligible for the data set?

Of course, as I said at the beginning, the answer to this post’s title is no. Harper will make the Phillies a better team and the length of his contract will not be the albatross that was Ryan Howard’s. However, the Phillies may be paying for 13 years of subprime Harper.

Credit for the piece goes to the FiveThirtyEight graphics department.

The Stunted Growth of North Korea

This piece from the BBC is a few years old, but it provides some interesting nuggets about North Korea. Unsurprisingly it appeared on my radar because of the coverage of the Trump–Kim summit in Vietnam. The article says it is nine charts that tell you all you need to know about North Korea. Now, I do not think that is quite true, but it does contain the following graphic—I hesitate to call it a chart—that illustrates one of my favourite details.

It's just a matter of inches
It’s just a matter of inches

The two figures illustrate the average height of a person from North Korea and then South Korea. What do you see? That the North Korean is shorter. This is despite the fact that the populations were the same just a few decades ago. The impact of years of malnutrition, undernourishment, and general lack of well-being have manifested themselves in the physical reduction of size of human beings compared to their nearly identical population to the south.

Thankfully the rest of the piece contains data on things like GDP, birth rates, and life expectancy. So there are some things in there that one should know about North Korea. As much as I find the story of height interesting, I struggle to think it is one of the nine things you should really know about the state.

Credit for the piece goes to Mark Bryson, Gerry Fletcher, and Prina Shah.

Where’s All the Oil Going?

Hint: not China.

Today’s piece is a nice little graphic from the Economist about the oil and natural gas industry in the United States. We have a bar chart that does a great job showing just how precipitous the decline in Chinese purchases of oil and liquid natural gas has been. Why the drop off? That would be the trade war.

Will they take it? For all the tea in China?
Will they take it? For all the tea in China?

The second graphic, on the right, is far more interesting. The data comes from BP, so the proverbial grain of salt, but it compares expected GDP and demand for energy by source from a baseline model of pre-Trumpian trade war policies to a future of “less globalisation”. Shockingly (sarcasm), the world is worse off when global trade is hindered.

You all know where I stand on stacked bar charts. They are better than pie charts, but still not my favourite. If I really want to dig in and look at the change to, say, coal demand, I cannot. I have to mentally remove that yellow-y bit from the bottom of the bar and reposition to the 0 baseline. Or, I could simply have coal as a separate bar next to the other energy sources.

Credit for the piece goes to the Economist Data Team.

The Bill Barr Bifurcation

So today’s piece is not a revolutionary piece of information design, but it is fascinating. For two or so years now, we have all heard about the Robert Mueller investigation into potential contacts between the Trump campaign, early administration, and the administration of Russian president Vladimir Putin.

To be clear, thus far, this has been an incredibly productive special counsel.

34: the number of indictments

6: guilty pleas from associates of the Trump campaign

But what happens when the whole thing is done, especially since prevailing Justice Department rules state sitting presidents cannot be indicted? Well to answer that, we have this piece from the Washington Post.

I'm hoping the report isn't…Barred…
I’m hoping the report isn’t…Barred…

Ultimately it is nothing more than a flow chart broken into pieces, separated by a textual narrative explaining the process. Now, I’m not certain how critical to the design each headshot is—especially Barr’s that looks especially frowny faced. However, the context in the above screenshot is crucial. The public does not necessarily have the right to the findings of the report if individuals in the report are not charged.

This means that design wise, we are looking at snippets of a larger chart interspersed with text. I would be interested to see the entire thing stitched together, but the textual breaks make a lot of sense. Overall, much like the sports pieces we looked at recently, this does a nice job of weaving textual story together with information design or data-driven content.

Credit for the piece goes to Dan Keating and Aaron Steckelberg.

Individualistic Immigrants

As many of you know, genealogy and family history is a topic that interests me greatly. This past weekend I spent quite a bit of time trying to sort through a puzzle—though I am not yet finished. It centred on identifying the correct lineages of a family living in a remote part of western Pennsylvania. The problem is the surname was prevalent if not common—something to be expected if just one family unit has 13 kids—and that the first names given to the children were often the same across family units. Combine that with some less than extensive records, at least those available online, and you are left with a mess. The biggest hiccup was the commonality of the names, however. It’s easier to track a Quinton Smith than a John Smith.

Taking a break from that for a bit yesterday, I was reminded of this piece from the Economist about two weeks ago. It looked at the individualism of the United States and how that might track with names. The article is a fascinating read on how the commonness or lack thereof for Danish names can be used as a proxy to measure the individualism of migrants to the United States in the 19th century. It then compares that to those who remained behind and the commonness of their names.

But where are the Brendans?
But where are the Brendans?

The scatter plot above is what the piece uses to introduce the reader to the narrative. And it is what it is, a solid scatter plot with a line of best fit for a select group of rich countries. But further on in the piece, the designers opted for some interesting dot plots and bar charts to showcase the dataset.

Now I do have some issues with the methodology. Would this hold up for Irish, English, German, or Italian immigrants in the 19th century? What about non-European immigrants? Nonetheless it is a fascinating idea.

Credit for the piece goes to the Economist Data Team.

Are Baseball’s Big Contracts Worth It?

On Tuesday the San Diego Padres signed Manny Machado to a guaranteed contract worth $300 million over the next ten years—though he can opt out after five years. Machado was one of two big free agents on the market, the other being Bryce Harper. One question out there is whether or not these big contracts will be worth it for the signing teams. This piece yesterday from the New York Times tries to look at those contracts and how the players performed during them.

Oh, David Price…
Oh, David Price…

Like the piece we looked at Tuesday, this takes a narrative approach instead of a data exploratory approach—the screenshot above is halfway through the read. Unlike the Post piece, this one does not allow users to explore the data. Unlabelled dots do not reveal the player and there is no way to know who they are.

Overall it is a very strong piece that shows how large and long contracts are risky for baseball teams. The next big question is where, for how long, and how much will Bryce Harper sign?

Credit for the piece goes to Joe Ward and Jeremy Bowers.

Trump Keeps Attacking the Special Counsel

Yesterday the New York Times published a fascinating piece looking at the data on how often President Trump has gone after the Special Counsel’s investigation. (Spoiler: over 1100 times.) It makes use of a number of curvy line charts showing the peaks of mentions of topics and people, e.g. Jeff Sessions. But my favourite element was this timeline.

All the dots. So many dots.
All the dots. So many dots.

It’s nothing crazy or fancy, but simple small multiples of a calendar format. The date and the month are not particular important, but rather the frequency of the appearances of the red dots. And often they appear, especially last summer.

Credit for the piece goes to Larry Buchanan and Karen Yourish.

Be Like Mike?

Back in 2012 the New York Times ran what is a classic data visualisation piece on Mariano Rivera. It tracked the number of saves the legendary Yankees closer had over his career and showed just how ridiculous that number was—and how quickly he had attained it. Last week, the Washington Post ran a piece that did something very similar about LeBron James, a future basketball legend, and Michael Jordan, definitely a basketball legend.

They might have game.
They might have game.

The key part of the piece is the line chart tracking points scored, screenshot above. It takes the same approach as the Rivera piece, but instead tracks scored points. Unlike the Rivera piece, which was more “dashboard” like in its appearance and function, allowing users to explore a dataset, this is more narratively constructed. The user scrolls through and reads the story the authors want you to read. Thankfully, for those who might be more interested in exploring the dataset, the interactivity remains intact as the user scrolls down the article.

While the main thrust of the piece is the line chart, it does offer a few other bar and line charts to put James’ career into perspective relative to the changing nature of NBA games. The line chart breaking down the composition of James’ scoring on a yearly basis is particularly fascinating.

But, don’t ask me about how he fits into the history of basketball or how he truly compares to Michael Jordan. Basketball isn’t my sport. But this is a great piece overall.

Credit for the piece goes to Armand Emamdjomeh and Ben Golliver.

A Macedonia by any Other Name

As someone who loves geography and maps, I have plenty of printed atlases and map books. One year, as a gift, my family gave me an early 20th century atlas. That one in particular is remarkable because of how much the world changed between 1921 and 2019—what was French West Africa is now several independent countries.

But our maps may be changing again as Greece has now formally recognised the Former Yugoslav Republic of Macedonia as North Macedonia, what most of the world simply calls Macedonia. But Greeks do not want you to confuse that Macedonia with the Macedonia (or Macedon) of Ancient Greece and Alexander the Great. The squabble over the name has prevented what will be North Macedonia from joining the European Union and NATO because of Greek objections.

As the Economist recently showed, however, it might take a little while before the name Macedonia catches on with the public at large. (Note, I intended to type North Macedonia but instead went with Macedonia. I opted to leave it incorrect just to show how difficult it will be.)

What's in a name?
What’s in a name?

The plot uses my favourite small multiples to look at six countries whose names have changed. Some of you may be unfamiliar with the originals. Bechuanaland may be the most obscure, but Burma and Ceylon may be far more familiar. Of course the historian in me then wonders why the mentions of countries spiked in books. But small multiples are usually not the place to do detailed annotations to humour an audience of one.

In terms of its design, we have an effective use of colour and line. I may have dropped the thin red line for the max 100 value as it makes the piece a bit busy overall, but that might just be house style.

Of course for this graphic in particular, we will have to wait several years before we can add Macedonia/North Macedonia to the plot.

Credit for the piece goes to the Economist Data Team.

Regionalisms for a Drink

We made it to the end of the week, everybody. And so now we get a look an xkcd take on one of my favourite little pieces: regionalisms. We all use terms that are specific to the areas where we live or grew up. For example, here in Philadelphia we call a cured meat, cheese, lettuce, and tomato on a long roll a hoagie. In other places, they are called a sub.

xkcd’s piece looks at regionalisms for a carbonated beverage.

I drink Brad's elixir
I drink Brad’s elixir

Credit for the piece goes to Randall Munroe.