Family Migration Patterns

On Saturday I attended an all-day seminar by the New England Historic Genealogical Society (NEHGS) at the new Museum of the American Revolution here in Philadelphia. Just fantastic. One of the lectures included some maps that looked at the distribution of families over a year span—turns out families did not stand still. Instead, they tended to “fill in” the sparser areas near their settlements and as land became sparser, the younger sons with less to inherit began to move further west, primarily, but also sometimes north into the rest of the nascent United States.

1671, 1721, and 1771
1671, 1721, and 1771
By 1821, the families are spread quite a bit further
By 1821, the families are spread quite a bit further

In terms of design, these work in a black and white book, so we do not get any fancy colours. Consequently, the location markers are well chosen as distinct shapes. I also liked the limitation of state outlines to only those states where descendants were present to limit the amount of distracting black lines on the graphic.

Perhaps not surprisingly I then decided to take my own stab at something similar late last night and this morning. I looked at only one line of ancestors, the Millers, and their descendants with the caveat that there are very much indeed holes in lines of the cousins and second cousins that I have not followed yet. But those I have included show, to a lesser degree, that patterns of movement west and north. The key difference is I extended mine to 1900, because the pattern becomes a little bit clearer over time in my family. I also stopped writing out names of individuals and just started writing out families, because it gets out of control pretty quickly.

1650
1650
1700
1700
1750
1750
1800
1800
1850
1850
1900
1900

Credit for the originals goes to Lois Kimball Matthews.

Credit for the coloured one is mine and my Miller family ancestors and their descendants.

For Whom to Root

The World Cup continues. Well for a few teams. Some have already been eliminated from the Round of 16. But for those Americans rooting for Team America, well, if you have not yet figured it out, you got knocked out well before the World Cup even started by…Panama. And so you are stuck in the question of who’s next? Thankfully FiveThirtyEight, in addition to their fantastic live probabilities that we looked at the other day, put together a little quiz to help you find your new team.

You answer seven questions and you are told your new allegiance. Questions like this:

How would you answer?
How would you answer?

Naturally I took the quiz and discovered that in addition to England, I am cheering for…

Goal? Make that skål!
Goal? Make that skål!

Yep. Fantastic since I was just there in December and happened to love Stockholm. But what I love about this piece is how it uses data to create the newfound bond I have with Sweden. Often times you take a quiz and are given an answer without any sense of why the answer was correct. Here, FiveThirtyEight plots the seven different variables used to create your newfound personality and then shows you how you scored.

Right in the middle there
Right in the middle there

It’s Friday, it’s the World Cup. Have a great weekend. And in addition to England on Sunday, I’ll now be cheering for Sweden against Germany on Saturday.

Credit for the piece goes to Michael Caley, Rachael Dottle, Geoff Foster, Gus Wezerek, Daniel Levitt, Emily Scherer, and Jorge Lawerta.

World Cup Match Probabilities

The World Cup has had some impressive matches and some stunners. (And the two are not mutually exclusive.) But if you are like me and have to work during most of the broadcasts, how can you follow along? Well thankfully FiveThirtyEight put together a nice statistical model that provides the probability of a team winning—or drawing—in real time.

Looking pretty good for Portugal this morning…
Looking pretty good for Portugal this morning…

The design is fairly simple: a small table with the score and probability followed by a chart drawn as the match goes on. (Clearly I took this image at the half.)

I included a snippet of the table below to show the other work the FiveThirtyEight team put out there. You can explore the standings, the screenshot above, as well as the matches and then the brackets later in the competition.

The table makes nice use of the heat map approach to show is likely to make easy of the different stages of the competition. Like I said the other day, they are high on Brazil, because Brazil. But a little lower on Germany. But never count Germany out.

Shouldn't Iran be in the top slot?
Shouldn’t Iran be in the top slot?

The only unclear thing to me in the table? The sorting mechanism. In Group B, at least whilst the Portugal match is ongoing, should probably have Iran at the top. After all, as of writing, it is the only team in the group to have won a match. The only thing I can guess is that it has to do with an overall likelihood to advance to the next round. I highly doubt that Iran will defeat either Spain or Portugal. But as with many knockout-style championships, anything can happen in a single match sample size.

Credit for the piece goes to Jay Boice, Rachael Dottle,Andrei Scheinkman, Gus Wezerek, and Julia Wolfe.

When the Whole Is Less Than the Sum of its Parts

Last week we talked a lot about trade—and we will get back to it. But the World Cup is now in full swing and I want to take a look at a couple of things this week. But to begin, the Economist published an article about the difficulty of predicting the outcome of World Cups. It looks at the quirks of random events alongside more quantitative things like ranking systems and their differences.

But one graphic in particular caught my attention. It explore the difference between the ranking in individual players versus the teams as a whole. In short, some teams are valued more highly than their constituent players and others vice versa. The graphic is fairly straightforward in that it plots the team value on the y-axis and the players’ on the x.

When sums are greater or less than the whole…
When sums are greater or less than the whole…

Personally? I would never bet against Germany. Or Brazil.

But if your author is lucky, he’s going to enjoy the England–Tunisia match this afternoon for lunch—rooting for England, of course. Though thanks to some online tools that’s not the only team I’m rooting for this year. But more on that later this week.

Credit for the piece goes to the Economist graphics department.

It’s Finally Sunny in Philadelphia (on the Weekend)

Here in Philadelphia, the weekend is forecast to be not rainy, which is a departure from the last several weeks. So this piece from xkcd’s Randall Munroe seemed appropriate.

That gap is also something like 150 million kilometres away
That gap is also something like 150 million kilometres away

Credit for the piece goes to Randall Munroe.

The World Cup Begins

If you live under a rock or in America, the World Cup starts today. (Go England.) So what else to have but a chart-driven piece from the BBC from last week about the World Cup. It features seven charts encapsulating the competition. But the one I want to focus on? It’s all about the host nations, in this case Russia.

To host, or not to host, that is the question of how much can you pay FIFA officials under the table…
To host, or not to host, that is the question of how much can you pay FIFA officials under the table…

On its design, I could go without the football icons to represent points on the dot plot, but I get it. (Though to be fair, they work well as icons depicting the particular World Cup event in another set of graphics elsewhere in the article.) In particular, I really like the decision to include the average difference between a host nation’s points in non-hosting matches vs. hosting matches.

It does look like the host nation scores more points per match than when they are not hosting. And that—shameless plug—reminds me of some work I did a few years back now looking at the Olympics and the host nation advantage in that global competition.

Credit for the piece goes to the BBC Data Team.

Trade with Canada

Yesterday we looked at trade with China. Today, we look at Canada, allegedly ripping off America. But what does the data say? Thankfully the Washington Post put together a piece looking at just that topic. And it uses a few interesting graphics to explore the idea.

The easiest and least controversial graphic is that below, which breaks down constituent parts of our bilateral trade.

The article also points out that very small dairy section, which is one focus of the administration's complaints. But look how tiny it is…
The article also points out that very small dairy section, which is one focus of the administration’s complaints. But look how tiny it is…

Note that the graphic does not just show the traditional goods part of the equation, but also breaks out services. And as soon as you consider that part of the economy the US trade deficit with Canada turns from deficit into surplus.

But the graphic also uses a pair of maps to look at that same goods vs. goods and services split.

The centre of it all…
The centre of it all…

Parts of the design of the map like the colours, meh. But the designers did a great job by breaking the standard convention of placing the Prime Meridian at the centre of the map. Instead, because the United States is the story here, the map places North America at the map’s centre. It does lead to a weird fracturing of the Asian continent, but so long as China is largely intact, that is all that matters to the trade story.

This all just goes to show that it is important to begin a conversation about policy with facts and understand the actual starting point rather than the perceived starting point.

Credit for the piece goes to Philip Bump.

Tariffs and Trade with China

Following up on yesterday’s post about the facts on tariffs, today we look at an article from Politico that polled voters on their feelings about trade and trade policy. Now the poll dates from the beginning of June and unfortunately a lot of things have changed since then. But, the data overwhelmingly supports the conclusion that voters, at that time at least, do not support placing tariffs on goods coming into the US.

Let’s take a look at another component of the article, however, a chart exploring the infamous trade deficit. First of all, trade deficits do not work like how the president says they do—but we will come back to that in another post. In short, trade deficits are neither good nor bad. They are just one way of describing one facet of a trade relationship between two countries.

This piece looks at the trade balance between the United States and China.

We will get into why this isn't all bad in another post
We will get into why this isn’t all bad in another post

Now, from the topical standpoint, it does a really nice job of showcasing how our imports have surged above our experts. From a topical standpoint, however, we do not know if this is a total trade deficit or just in goods, like the president prefers to talk about, or in goods and services, the latter of which accounts for way more than half of the US economy.

From a design perspective, I have a few thoughts and the first is labelling. The chart does label the endpoints of the data set, 1985 and 2017. But aside from a grey bar representing the Financial Crisis, there are few other markers to indicate the year. In smaller charts, I often do this myself, because space. But here there is enough space for at least a few intervening years to be labelled.

Secondly, the white outline of the red line. I have talked before of a trend to showcase a line over other lines with that thin stroke. But this is the first time I can recall the effect being used over an area filled with colour. Is it necessary? Because the area is light and the line dark and bright, probably not.

Then the outline appears on the text in the graphic, in particular the labels of imports, exports, and the trade deficit label. The labels for the imports and exports likely are necessary because of that light grey used for the text. But, as with the line for the trade deficit, its label likely provides sufficient contrast the thin white outline isn’t necessary.

Credit for the piece goes to Jeremy C.F. Lin.

The Facts on Tariffs

Unless you avoid the news, we all heard a lot about tariffs this weekend. So this morning, instead of going with some other things I found, I decided I wanted to look and see just what the data is on tariffs. Turns out Trump is wrong on the data about tariffs. In short, in 2016 the US had a slightly higher average tariff for all products at 1.61%. The EU was at 1.6%. And the Canadians? They charged an outrageous 0.8%.

Apologies for the length on this one
Apologies for the length on this one

The data comes from the World Bank.

And over breakfast, I did not really have the time to clean this graphic up, so it shows the whole world. Though it goes to show you, the western countries against which Trump raged this weekend generally have low tariffs, some lower than what the US.

Credit for this one is mine.

Philly Rules

Yo. C’mon, bro. This jawn is getting tired. Just stop already.

If you did not catch it this week, the most important news was Donald Trump disinviting the Super Bowl champions Eagles to the White House to celebrate their victory over the Patriots. He then lied about Eagles players kneeling during the US anthem—no player did during the 2017 season. He then claimed that the Eagles abandoned their fans. Yeah, good luck convincing the city of that.

So naturally we have a Friday graphic for youse.

That's 25,304.
That’s 25,304.

Full disclosure: I root for the Patriots. But I mean, seriously, can’t youse guys do the math?