My Irish Heritage

This week began with Saint Patrick’s Day, a day that here in the States celebrates Ireland and Irish heritage. And I have an abundance of that. As we saw in a post earlier this year about some new genetic ancestry results, Ireland accounts for approximately 2/3 of my ancestry. But as many of my readers know, actual records-based genealogy is one of my big hobbies and so for this Saint Patrick’s Day, I decided to create a few graphics to capture all my current research on my family’s Irish heritage.

In the current political climate wherein we hyperfixate on immigration, I started with my ancestors’ immigration to North America.

My graphic features a timeline marking when certain ancestors arrived, with the massive caveat I do not know when all my Irish ancestors arrived. I separate the ancestors into paternal and maternal lines. My maternal lines are only half Irish, and unfortunately most of them offer little in terms of early records or origins and so the bulk of the graphic lands on my paternal lines.

I did sort out that two–four lines began in Canada and included them with orange dots. (The one couple married in Ireland shortly before setting sail for Canada. The other two lines married in Canada.) I also added a grey bar representing the length of the Great Famine. I suspect a number of my ancestors arrived during the famine based on the fact they begin to appear in the records around 1850, but sadly none of those records state when they arrived specifically instead they just appear in the United States.

I also used filled vs. open dots to indicate whether or not I had primary source documents for arrivals. I.e., a passenger manifest, naturalisation papers, &c. that specifically details immigration information weighs more heavily as evidence than, say, a census record wherein a respondent can say he or she arrived in such a year. (Spoiler, census records are not infallible.)

The overall takeaway, most of my Irish immigrants, for whom I have information, arrived in the middle of the 19th century within a decade of the Great Famine.

The second graphic features even more difficult data to find. Whence did my ancestors come?

For those unfamiliar with Irish genealogy, finding the town or parish from which your ancestors hailed can be nigh impossible. To start, you need some kind of American-based record that gives you a clue as to where in Ireland to look—a county or city. From my experience, most records simply state places of birth as “Ireland”—not very helpful.

Then if you can get back to Ireland, the typical resource you might use in the United States, United Kingdom, and other countries is the census. And Ireland did record a census every ten years, beginning in 1821. Unfortunately 1861 and 1871 were destroyed shortly after the data was recorded. Then during World War I, the 1881 and 1891 censuses were pulped due to a paper shortage. Then in 1921, there was no census because of the whole Irish Civil War thing. Finally in 1922, during the Battle of Dublin in the whole Irish Civil War thing, the Public Records Office at the Four Courts, which held government records dating back hundreds of years as well as guns and ammunition, was blown up. And with the ammunition, so too was blown up the census records for 1821, 1831, 1841, and 1851. In short, genealogists only have access to census records for 1901 and 1911. (The 1926 census organised post-Civil War, does not become public until 2027.)

Then you have the whole unavailability of Catholic Church records, which is another long discussion about the conflict between Protestants and Catholics in Ireland. (Just a minor thing in Irish history.)

There are some civil public records available and they begin in the mid-19th century, which in many cases is just a bit too late for genealogical purposes.

Suffice it to say, Irish genealogy can be tricky and in 15 years of researching it myself, I have only been able to find the origins of 10 Irish immigrant ancestors. For context, to the best of my knowledge I have 18 Irish immigrant ancestors. Thus that map is very empty.

The second map of the United States and United Kingdom is more complete because more complete records. It maps the residences of my Irish and Irish-American ancestors. Initially I attempted to link all the towns and cities with arrows to show the migration patterns, alas it quickly became a mess at such a small scale. That remains a project for another day.

My Irish heritage is a thing of which I am proud, and I am glad to say my genealogy hobby has allowed me to explore it much more deeply and richly than a green-dyed pint would allow.

Credit for the pieces is mine.

113 Years Later and We’re Still Talking About Watertight Compartments

Earlier this week, a Portuguese-flagged cargo container ship collided with an American-flagged tanker just off the Humber estuary in Yorkshire, England. The American-flagged ship, the Stena Immaculate, carries aviation fuel for the US Air Force. The Solong, the Portuguese-flagged tanker, carries alcohol, which is far better than the toxic chemicals initially feared.

We still know very little about the circumstances of the collision other than the Solong, travelling at 16 knots, slammed into the port side of the Stena Immaculate, which was anchored offshore.

I decided to write a little post because I enjoyed this graphic from the BBC, which details why the Stena Immaculate has not yet sunk—and at the time of my writing is not believed to be in danger of—despite the large hold amidships.

The graphic uses a simple line illustration of a bulk carrier in both 3/4 and a frontal view. The first shows how vessels like the Stena Immaculate separate their cargo into distinct holds, often watertight, so that, should a collision occur, the damage will not flood the entire ship or affect the load of the cargo. For the latter, sloshing liquids, as one example, can alter the centre of gravity and negatively impact ship stability.

The second line drawing illustrates the value of a double-hulled vessel wherein the outer hull shields the inner hull from puncture and prevents massive flooding of interior spaces.

Of course on 11 March, we are a little over a month away from the anniversary of the sinking of RMS Titanic. (In)famously in that case the critical issue was the same idea of watertight compartments. She had enough of them, but crucially they did not rise to the top of the ship as they would have necessarily impacted the luxury of first and second class accommodations. Titanic also did not have a double hull—her bottom was, but this did not run up the ship’s sides to the level where the iceberg impacted the ship.

Overall, I really like this graphic. It needs no elaborate and detailed illustration. Nor does it need sophisticated animations. All it uses is simple line illustrations.

Credit for the piece goes to the BBC graphics department.

A Refreshed Look at My Ethnic Heritage

Late last week I received an update on my ethnic breakdown from My Heritage, a competitor of Ancestry.com and other genealogy/family history/genetic ancestry companies. For many years, the genealogical community had been waiting for this long-promised update. And it has finally arrived.

For my money, My Heritage’s older analysis, v0.95, did not align with my historical record research—something I have done for almost 15 years now. That DNA analysis painted me with an 85% heritage of Irish, Scottish, and Welsh. Because I have spent a decade and a half researching my ancestors, I know all of my second-great-grandparents, 16 total. 85% means 13–14 of them would be Irish, Scottish, or Welsh. However, four of them are Carpatho-Rusyns from present day eastern Slovakia. And nowhere in my research have I found any connection to the Baltic states or Finland.

Compare that to the update.

Here we have a drastically reduced Irish component that, importantly, has been split from Scottish and Welsh, which now exists as its own genetic group. The East European group appears too low, but perhaps My Heritage identified some of my Slavic ancestry as Balkan—there is a sizeable Carpatho-Rusyn community in Vojvodina, an autonomous oblast in Serbia. Maybe Germanic too? That would start to push it near to 20%.

I do have English ancestry—my Angophilia must come from somewhere—though it is relatively small and I can trace it to the Medieval period. That includes more of the Norman elite than the Anglo-Saxon plebs and so seeing Breton register could be indicative of that Norman/Anglo-Saxon population mixture.

But how does My Heritage results compare to those provided by Ancestry and FamilyTreeDNA, two competitors whose services I have also used. And how does it compare to my actual historical document research?

My Heritage’s newest analysis certainly hits a lot better and is nearer to Ancestry, which aligns best with my research. I do have two questions for my second-great-grandparents. One surrounds Nathaniel Miller, one of whose grandparents (Eliza Garrotson) may not be English but rather Dutch from the Dutch colonisation of the Hudson River Valley in New York south of Albany.

The other question revolves around William Doyle. His mother is identified in the records variously as English and Irish. A family story on that side of the family also suggests one ancestor of English descent. And finally, a recently discovered marriage record for his parents details how his mother (Martha Atkins) was baptised and converted to Catholicism as an adult prior to her marriage. Not all Irish are Catholic, but the vast majority are and that would also suggest Martha was not Irish.

Taking those two questions into account, I have a small range of expected values for my English ancestry and a slightly larger one for my Irish and you can see those in the graphic.

When you compare that to the My Heritage results alongside the Ancestry and FamilyTreeDNA results you can see Ancestry aligns best with my research whereas FamilyTreeDNA aligns the least. My Heritage now falls squarely between the two. And so I consider their update a success. I think the company still has some work to do, but progress is progress.

Credit for the pieces is mine.

Racing to the Final Finish Line

Thoroughbred racing is big business. And Philadelphia’s Parx Casino owns a racing track that, in a recent article in the Philadelphia Inquirer, has seen a number of horse deaths. The article includes a single graphic worth noting, a bar chart showing the thoroughbred death rate. The graphic contrasts rising deaths at Parx with a national trend of declining deaths.

Traditionally rate statistics are shown using dots or line. The idea is that a bar represents counting stats, i.e. how many total horses died. I understand the coloured bars present a more visually compelling graphic on the page, and so I can buy that reason if you are selling it.

Labelling each datapoint, however, with a grey text label above the bar remains unnecessary. They create sparkling, distracting grey baubles above the important blue bars. If you need the specificity to the hundredths degree, use a table. This graphic is also interactive. The mouseover state is where a specific number can be provided, adding an additional layer or level of depth in a progressive disclosure of information.

Credit for the piece goes to Dylan Purcell.

Titan’s Final Words

Last week wrapped up the Coast Guard’s two-week inquiry into the sinking of the submersible Titan, which imploded on a dive to the wreck of Titanic. The BBC summarised the findings in an article at the weekend. It included a number of fascinating annotated photographs identifying parts of the wreckage. But it also included the following graphic, which captures the text messages sent by the Titan and the depths at which the messages were sent.

This is significantly better than a number of pieces I have seen lately, to be fair, most of those focus on the dive depths of various objects and creatures. Mostly that is because the graphics—this one included—do not scale the objects to the depths. I understand the why; many would be too small to see. But I think that difference in scale really hits home just how deep Titanic rests on the seabed.

Because this graphic does not focus on the dive depths of objects, but rather the texts Titan sent at what depth, the scale issue is less relevant. Though, the weird bit is how Titanic sits below 3800 m. She rests at 3840 and that little dip on the sea floor looks closer to 400 m.

Overall, though, a solid piece.

Credit for the piece goes to the BBC’s graphics department.

Tired of These Motherf*cking Sox on This Motherf*cking Plane

At least, that’s what I imagine South Siders saying in Chicago as they watch the White Sox team charter plane land at Midway. For those not following America’s Major League Baseball season, the Chicago White Sox are one of two clubs claiming Chicago as their home. (The other being the Cubs.) And the White Sox—not to be confused with your author’s favourite club, the Red Sox—are on track to be one of the worst clubs in the modern (post-1900) history of the sport. They have already tied the New York Mets’ record of 120 losses and there are still six left to play.

Earlier this month the Athletic detailed what has gone wrong for the Pale Hose. One of the things that stood out to me the most in the reporting was the complaints about the club’s charter aircraft, an Airbus A320, as the article points out a 1980s aircraft. The article in particular mentioned how other cheapskate teams—including the Boston Red Sox—opt for nicer aircraft with more first-class accommodations for players and staff. Then they cited a graphic shared on Twitter last year by Jay Cuda and when I saw that, I knew I had to cover it.

One thing I find fascinating is how the White Sox use United Airlines for their charter. United Airlines operates the charter—as it does for the Cubs and other airlines. That it does so for the two Chicago teams makes all the sense in the world as the company is headquartered in the Loop in downtown Chicago. It is also one of the largest airlines and thus makes sense in that dimension too.

But as those frequent air travellers among you will know, Chicago has two airports: O’Hare and Midway. O’Hare in northwest of downtown and closer to the Cubs and is the city’s primary airport. But the White Sox typically fly out of Midway, which is just a couple miles from (New) Comiskey. (I presume the team bus hops on the Dan Ryan/I-90 to the Stevenson/I-55 then exits on Cicero.)

Weird because United does not service Midway. And so United, which operates out of O’Hare, must fly aircraft to Midway to then transport the White Sox. I suppose the White Sox would not want to charter a Southwest aircraft, though…. In my own lifetime I think I have flown in and out of Midway only twice. And I lived in Chicago for eight years. (And the White Sox were terrible for probably six of them.)

Some non-White Sox things notable from the graphic. One, iAero no longer exists, so I would be curious whom the Texas Rangers and Oakland Athletics used this year. The Rangers probably used a reputable airline. The Athletics probably made their players and staff charter their own transport.

I also did not realise that even last year the McDonnell Douglas MD-80 still carried passengers in the United States. I assume that by 2024, the Detroit Tigers have fully transitioned to that Boeing 737. I find it fascinating that only the Tigers own their own aircraft. I would be curious to know why more teams do not, though of course it has to be money.

With whom else would the Blue Jays fly but Air Canada?

Finally, I am surprised that my Boston Red Sox use Delta, because that’s a normal, non-budget airline. And anyone who follows the Red Sox know the Red Sox are no longer in the habit of spending money. I thought they would use jetBlue, which is the sponsor for Fenway South, formally jetBluePark, in Fort Myers, Florida, where the Red Sox have their spring training and development league complex.

Anyways, happy Friday, all. At least you don’t play or work for the Chicago White Sox. (Though I suppose it is possible you do, because I do have a large number of readers from Chicago. But I doubt it.)

Credit for the piece goes to Jay Cuda.

I Need My Sharpie. Where’s My Sharpie?

Because who does not recall the great Sharpie forecast track by the National Hurricane Center (NHC)?

Earlier this summer, in the middle of the hurricane season, the National Oceanic and Atmospheric Administration’s (NOAA’s) NHC released a new, experimental warning cone map. For those unfamiliar, these are the maps that have a white and white-shaded forecast for where the centre of the storm will track. Importantly, it is not a forecast of where the storm will impact. If you have ever been through a hurricane—would not recommend—you know you need not be near the centre to feel the storm’s impact.

I have been waiting for a significant storm to threaten the United States before taking a look at these. (It is also important to note, these new maps apply only to the United States.) But this is the current map for Hurricane Helene as of Wednesday morning.

For those of you who, like me, are familiar with these, you will see the red lines along the coast that indicate hurricane warnings. Blue lines indicate current tropical storm warnings. Not on this map are pink lines for hurricane watches and yellow lines for tropical storm watches. But all these lines only represent watches and warnings along the coast. Little dots indicate the storm’s forecast position at certain times and through letter indicators its strength. The full white areas are the forecast track for the centre of the storm through the first three days. The shaded area is for days 4–5.

Contrast that with the new, experimental version.

The background of the map remains the same. In my perfect world, I would probably drop the grey and blue back a little bit, but that is not the end of the world. Instead, the biggest change is that the tropical storm and hurricane watches and warnings, which have always been declared for full counties inland, are now shown on the map.

You can see the red hurricane warnings are now forecast to move through the eastern Florida panhandle and southern Georgia with tropical storm watches forecast for the inland counties north and east of those. And then the three- and five-day forecasts have blended into a single white cone track. Subtly, the stroke or outline for that has changed from black to solid white. That helps reduce the distracting visuals on the map and emphasise the forecast track and watches and warnings.

Overall, I think is a really strong and important and potentially life-saving improvement to the graphics. Could things be improved more? Absolutely. But sometimes the only way to make improvements is through slow and steady incremental changes. This update does that in spades.

Credit for the piece goes to the NHC graphics team.

For Whom the Teamsters Poll Tolls

The Teamsters Union decided to officially endorse neither candidate in the 2024 US presidential election. Prior to their non-announcement announcement, however, the union surveyed its members and then released the polling data ahead of the announcement.

Of course, the teamsters represent but a single union in a large and diverse country. More importantly, the survey results reported only the share of responses for either candidate—and “Other”—so we have no idea how many of what number opted for whom. But hey, it’s another talking point in the final six weeks of the campaign.

Naturally, I decided to visualise the data.

The trend is pretty, pretty clear. The union’s rank-and-file clearly support Trump for president, with the exception of the teamsters in the District of Columbia. (Note, no survey was taken in Wyoming.) In fact, in only eight states plus DC did Harris’ support top 40%.

Credit for the piece is mine.

Fear the Floodwaters

This past weekend saw some flooding along the East Coast due to the Moon pulling on Earth’s water. In Boston that meant downtown flooding, including Long Wharf. The Boston Globe’s article about the flooding dwelt with more impact, causes, and long-term forecasts—none of which really warranted data visualisation or information graphics. Nonetheless, the article included a long time series examining the change in Boston’s sea level relative to the mean.

For me, the graphic works really well. The data strips out the seasonal fluctuations and presents the reader with a clear view of rising sea levels in Boston. If the noisiness of the red line distracts the reader—one wonders if an annual average could have been used—the blue trend line makes it clear.

And that blue trend line has a nice graphic trick to help itself. Note the designer added a thin white stroke on the outside of the line, providing visual separation from the red line below.

My only real critique with the graphic is the baseline and the axis lines. The chart uses solid black lines for the axes, with grey lines running horizontally depicting the deviation from the mean sea level. But the black lines draw the attention of the eye and thus diminish the importance of the 0 inch line, which actually serves as the baseline of the chart.

If I quickly edit the screenshot in Photoshop, you can see how shifting the emphasis subtly changes the chart’s message.

Overall, however, the graphic works really well.

Credit for the piece goes to John Hancock.

Labelling Line Charts

Today I have a little post about something I noticed over the weekend: labelling line charts.

It begins with a BBC article I read about the ongoing return to office mandates some companies have been rolling out over the last few years. When I look for work these days, one important factor is the office work situation and so seeing an article about the tension in that issue, I had to read it.

The article includes this graphic of Office of National Statistics (ONS) data and BBC analysis.

Overall, the chart does a few things I like, most notably including the demarcation for the methodology change. The red–green here also works. Additionally the thesis expressed by the title, “Hybrid has overtaken WFH”, clearly evidences itself by the green line crossing the blue. (I would quibble and perhaps change the hybrid line to red as it is visually more impactful.)

I also like on the y-axis how we do not have a line connecting all the intervals. Such lines are often unnecessary and can often add visual clutter, see yesterday’s post for something similar. I quibble here with dropping the % symbol for the zero-line. Since the rest of the graphic uses it, I would have put the baseline as 0%. And that baseline is indeed represented by a darker, black line instead of the grey used for the other intervals.

Then we get to the labels on the right of the graphic. Firstly, I do not subscribe to the view charts and graphs need to label individual datapoints. If the designer created the chart correctly, the graph should be legible. Furthermore, charts show relationships, if one needs a specific value, I would opt for a table or a factette instead. These are not the most egregious labels, mind you, but here they label the datapoint, but not the line. Instead, for the line the reader needs to go back to the chart’s data definition and retrieve the information associated with the colour.

Now compare that to a chart representing Major League Baseball’s playoff odds from Fangraphs.

Here too we have mostly good things going on, but I want to highlight the labelling at the right. This chart also includes the precise value, which is fine, but here we also have the actual label for the lines. The user does not need to leave the experience of the chart to find the relevant information, although a secondary/redundant display or legend can be found at the bottom of the chart.

If you can take the time to label the end value, you may as well label the series.

Credit for the BBC graphic goes to the BBC’s graphics department.

Credit for the Fangraphs piece goes to Fangraphs’ design team.