Correcting CBS News Charts

One of the long-running critiques of Fox News Channel’s on air graphics is that they often distort the truth. They choose questionable if not flat-out misleading baselines, scales, and adjust other elements to create differences where they don’t exist or smooth out problematic issues.

But yesterday a friend sent me a graphic that shows Fox News isn’t alone. This graphic came from CBS News and looked at the California recall election vote totals.

If you just look at the numbers, 66% and 34%, well we can see that 34 is almost half of 66. So why does the top bar look more like 2/3 of the length of the bottom? I don’t actually know the animus of the designer who created the graphic, but I hope it’s more ignorance or sloppiness than malice. I wonder if the designer simply said, 66%, well that means the top bar should be, like, two-thirds the length of the bottom.

The effect, however, makes the election seem far closer than it really was. For every yes vote, there were almost two no votes. And the above graphic does not capture that fact. And so my friend asked if I could make a graphic with the correct scale. And so I did.

One really doesn’t need a chart to compare the two numbers. And I touch on that with the last point, using two factettes to simply state the results. But let’s assume we need to make it sexy, sizzle, or flashy. Because I think every designer has heard that request.

A simple scale of 0 to 66 could work and we can see how that would differ from the original graphic. Or, if we use a scale of 0 to 100, we can see how the two bars relate to each other and to the scale of the total vote. That approach would also have allowed for a stacked bar chart as I made in the third option. The advantage there is that you can easily see the victor by who crosses the 50% line at the centre of the graphic.

Basically doing anything but what we saw in the original.

Credit for the original goes to the CBS News graphics department.

Credit for the correction is mine.

Fixing Where 9/11 Happened

Today we have a quick piece, but one that I read at the weekend, you know, the 9/11 20th anniversary one. The article served as a quick summary of the day for those who either don’t know or don’t remember. After 20 years, there are a lot of people who have come of age in a post-9/11 world that were either not born or too young to recall those before times.

And so this map helped to identify the location of the three sites impacted by the planes: the World Trade Centre in New York, the Pentagon in Washington, and a field in Shanksville, Pennsylvania.

See anything off?

Except look closely at the graphic.

Little is where it belongs. The World Trade Centre marker is on Sandy Hook, New Jersey. The Pentagon is nearer to Fredericksburg than Washington. And Shanksville is in Maryland.

You can leave the dots for Washington and New York, as they are correctly placed. But why not just use some typography to put the World Trade Centre beneath New York and the Pentagon beneath Washington?

What makes it peculiar is that Shanksville is in Maryland, so it’s dot is just wrong. And so here’s a rough fix for that part of the graphic.

It was just an odd graphic for an article about one of those days that will be long-remembered in history.

Credit for the piece goes to the BBC graphics department.

Covid Update: 13 September

It’s been a little less than a week since our last Covid-19 update for Pennsylvania, New Jersey, Delaware, Virginia, and Illinois. At the time we had just come back from the Labour Day holiday here in the United States and that left us with two big questions. First, what would the data show after we began to process the tests after the extra time off? Second, would the holiday itself cause any increase in the numbers of new cases?

We also need to remember that last week we had seen some positive signs in some states. And we can start with those states today.

New case curves for PA, NJ, DE, VA, & IL

In New Jersey and Illinois we had the clearest evidence of this fourth wave peaking and new cases, whilst still climbing, slowing down with the seven-day average beginning to fall. The good news continues to be that both states continue to show signs their fourth waves have peaked. In fact, Illinois appears to be beginning a downward trajectory. New Jersey has flattened the curve, in other words exhibiting steady numbers of new cases each day.

Delaware appeared to have peaked, but after a brief dip following the holiday, the numbers have begun to shoot back up again. The seven-day average as of yesterday hit 457 new cases per day, exceeding that spike just prior to Labour Day. In other words, it appears that the fear of the holiday increasing rates of new cases, just as they appeared to be peaking came true in Delaware.

What about Virginia and Pennsylvania? Well in the former we had some indications prior to Labour Day that Virginia may have been approaching a peak of new cases. And now you can throw that out the window. Over the three-day holiday weekend, Virginia added just under 11,000 new cases. This past weekend, only two days, Old Dominion added just over 9,200. Not surprisingly the seven-day average spiked upward yesterday to 4,700 new cases per day. If the fourth wave continues at that pace, it will soon surpass the rates we saw last winter.

And in Pennsylvania the data is also not great. We had seen perhaps the beginning of a decline after a peak prior to Labour Day. In the week since? Well, the numbers of new cases have started climbing once again. In fact, yesterday the seven-day average climbed to just under 4,100 new cases per day. That is still below the spring peak and well below winter, but surpasses the numbers we saw just before Labour Day.

In other words, the fear of Labour Day creating new cases appears to have come true.

So then what about deaths? We know that deaths from any increase in cases won’t manifest in the data for a few weeks.

Death curves for PA, NJ, DE, VA, & IL.

Starting with good news, let’s look at Pennsylvania. Two days after Labour Day the Commonwealth’s seven-day average for deaths reached 30.1 deaths per day. In the almost week since that rate has steadily dropped to 24.3 per day. Ideally we would want to see that trend extend beyond five days. Because if the Labour Day surge persists, it wouldn’t be beyond belief to imagine deaths rising again in coming days.

But that’s also about it for good news. True, Delaware went from 0.9 deaths per day to just 1.0. But that’s more of a stable rate than anything. All the other states have seen their death rates continue to climb of late. Although, we would also expect deaths to peak sometime after the peak in new cases, so this trend makes sense.

In New Jersey deaths climbed from 12.4 to 13.1 per day. Not terrible, but again still an increase in deaths. The worst increases were in Illinois and Virginia. In Illinois deaths have continued to climb, rising from 30.7 last time we wrote to 34.7. But Virginia has seen the worst, despite an apparent dip around Labour Day. Instead people are dying at increasing rates, climbing from 16.7 deaths per day to 27.1 as of yesterday.

Unfortunately, until we see new cases truly peak in Virginia those numbers are likely to continue climbing in coming days and possibly weeks.

Mapping the Mullica Hill Tornado

When the remnants of Hurricane Ida rolled through the Northeast two weeks ago, here in the Philadelphia region we saw catastrophic flooding from deluges west of the city and to the east we had a tornado outbreak in South Jersey. At a simplistic level we can attribute the differences in outcomes to the path of the storm. As Ida was no longer a hurricane she developed what we call warm and cold sectors along the frontal boundary. Long story short, we can see different types of weather in these setups with heavy rain in the cold and severe weather in the warm. And that’s what we saw with Ida: heavy, flood-causing rains in the cold sector north and west of Philadelphia and then severe weather, tornadoes, in the warm sector south and east of the city.

But I want to talk about the tornadoes, and one in particular: an EF3 tornado that struck the South Jersey town of Mullica Hill. EF3 refers to the enhanced Fujita scale that describes the severity of tornadoes. EF1s are minor and EF5s are the worst. The Philadelphia region has in the past rarely seen tornadoes, but even moderate strength ones such as an EF3 are almost unheard of in the area.

This tornado caused significant damage to the area, but was also remarkable because it persisted for 12 miles. Most tornadoes dissipate in a fraction of that length. The Mount Holly office of the National Weather Service (NWS) produced a few graphics detailing tornadoes Ida spawned. You can see from the timeline graphic that the Mullica Hill tornado was particularly long lasting in time as well as distance travelled, surviving for twenty minutes.

To briefly touch on the design of this graphic, I think it generally works well. I’m not certain if the drop shadow adds anything to the graphic and I might have used a lighter colour text label for the times as they fight with the graphical components for visual primacy. Secondly, I’m not certain that each tornado needs to be in a different colour. The horizontal rules keep each storm visually separate. Colour could have instead been used to indicate perhaps peak severity for each tornado or perhaps at specific moments in the tornadoes’ lives. But overall, I like this graphic.

NWS Mt Holly also produced a graphic specifically about the tornado detailing its path.

Here we have a graphic incorporating what looks like Google Earth or Google Maps imagery of the area and an orange line denoting the path. At various points faint text labels indicate the strength of the tornado along its path. A table to the left provides the key points.

From a graphical standpoint, I think this could use a bit more work. The orange line looks too similar to the yellow roads on the map and at a quick glance may be too indistinguishable. Compare this approach to that of the Philadelphia Inquirer in its writeup.

Here we have a map with desaturated colours with a bright red line that clearly sets itself apart from the map. I think a similar approach would have benefitted the NWS graphic. Although the NWS graphic does have a stroke weight that varies depending upon the path of the tornado.

I also have a graphic made by a guy I know who lives in the area. He took that maximum tornado width of 400 yards and used screenshots of Google Maps in combination with his own direct evidence and photos and videos from his neighbours and their social media posts to try and plot the tornado’s path more granularly. Each red mark represents storm damage and a width of 400 yards.

I’m not going to critique this graphic because he made it more for himself to try and understand how close he was to the storm. In other words, it wasn’t meant to be published. But I’m thankful he allowed me to share it with you. But even here you can see he chose a colour that contrasted strongly with the background satellite views.

All of this just goes to show you the path and devastation one tornado caused. And that one tornado was just a fraction of the devastation Ida wrought upon the Northeast let alone the rest of the United States.

The Hexagons of Saturn

Well, it’s the end of another week. I’ll save the bigger posts I have planned for next week and instead end with this little astronomy/geometry gem from xkcd. It takes a look at Saturn’s polar storm that takes the shape of a hexagon, not a circle or anything else.

Gooaaaallllllll

Credit for the piece goes to Randall Munroe.

Covid Update: 8 September

Last time we looked at the state of the Covid-19 pandemic in Pennsylvania, New Jersey, Delaware, Virginia, and Illinois we had some encouraging signs. In particular we had evidence of a potential slowdown in New Jersey, Delaware, and Illinois and I wrote that I would not be surprised if we saw a peak in new cases. Virginia was the only state where things were bad and getting worse, though the rate at which they were getting bad had begun to slow. Finally, Pennsylvania had some conflicting data with its new cases and I wanted to see another week’s worth of evidence.

So a week or later, where are we?

First, the obvious caveat that the Labour Day holiday may affect these numbers in both the short term collection of data and the medium term potential for a surge of new cases from gatherings and parties.

New case curves for PA, NJ, DE, VA, & IL.

If we look to last week’s good news states of New Jersey, Delaware, and Illinois, we do indeed see what we’ve long been waiting for since this summer: potential peaks in this fourth wave. New Jersey and Illinois. We can see the trend most clearly in New Jersey and Illinois where the beginnings of a decline from the peak appear as a slight dip from what we hope was the peak of the fourth wave. In Delaware that peak is still apparent, but the decline is less so.

Additionally for Delaware, over the last week the numbers rose above those of the third wave, i.e. in terms of cases the fourth wave is worse than the third wave of the spring. We had already seen Illinois reached that unfortunate milestone and fortunately for New Jersey the Garden State peaked at a point nearly half of the third wave’s peak.

What about last week’s bad news state? Well, Virginia, which does not report data on weekends, had an additional day of zero earlier this week. You can see that with the peak on Tuesday, which exceeded 10,000 new cases in one day. Of course, that’s really over three days and so we are talking about 3,000 per day. Unfortunately that extra holiday affected the seven-day average and caused a sudden fall. If we look closely at the data we can see that the trend probably points to a situation that continues to worsen. Two weeks ago the seven-day average was generally about 2800; last week it was about 3100; and heading into this week—excluding the Labour Day aberration—it looks to be about 3300. The very rough differences of 300 to 200 could point to a slowing rise in new cases, the necessary precursor to a peak, but we’d need to see how the rest of this week plays out before we can make any determination. But I’d probably say we are beginning to see the first signs of arriving at a peak in the coming weeks, maybe not next but perhaps in two or so.

And then we have Pennsylvania, where we had conflicting data and I wanted another week’s worth before making comment. I think the Commonwealth has indeed peaked, but unlike in New Jersey, Delaware, and Illinois, this peak looks more complicated. Note how we do have the recent spike I alluded to last week, but subsequent to that spike the numbers have been lower. I say complicated because in the aftermath of the holiday weekend we are seeing a slight tick upwards in the number of new cases, but it’s still below that spike. Consequently I’m reasonably confident we’ve just begun to peak here in Pennsylvania, but I’ll clearly want another week’s data before saying that with more authority.

What about deaths? How have those progressed over the last week?

Here too I’d be remiss if I failed to reiterate the caveats above that Labour Day can skew with the seven-day average—as we saw with Virginia—though any clustering of a surge of deaths would likely be weeks away given death’s status as a lagging indicator.

Death curves for PA, NJ, DE, VA, & IL.

Here, unfortunately, we have not reached peak deaths for the fourth wave, at least not for all five states.

Starting with the bad news, we have two states where the numbers continue to climb. In Pennsylvania and Illinois, the two largest states in the data set, we have deaths continuing to climb. Both states’ averages exceeded 30 deaths per day yesterday. For Pennsylvania that is the first time since early June. We need not go much further back for Illinois, which had last recorded an average of 30 deaths per day in late May.

In fact, Illinois yesterday reached an average of 30.7 deaths per day. The state’s peak during the third wave was not much higher, 31.7. Given that we are seeing higher numbers of new cases in the fourth wave than we saw in the third, I would expect the deaths to continue climbing and exceed the third wave’s death rate in coming days.

Pennsylvania’s seven-day average peaked at 51 deaths per day earlier this summer and the Commonwealth’s average of 30.1 yesterday is still far below that level. Given that this fourth wave appears to be less severe in terms of new cases than the third wave, I doubt we reach the level of 51 per day, but I wouldn’t put a level in the 40s out of reach.

Next we have three states where we probably have some good news. Delaware is probably the easiest to report. Since the state has so few people with which to begin, we can expect to see fewer deaths. Indeed, this fourth wave may have peaked and did so at only 3.4 deaths per day. But yesterday that number fell to 0.9. However, I wonder if that is due to the Labour Day holiday. I would want to see more data before saying with more authority that Delaware may have peaked in terms of deaths.

Virginia and New Jersey both present cases where the seven-day averages are now down from some high numbers about two weeks ago. In late August, New Jersey reached a level of 14.3 deaths per day and Virginia hit 21.4. Yesterday’s averages had each state at 12.4 and 16.6, respectively. That looks good. But we can also see that in the last two days both states reported their highest number of daily increases since the beginning of the fourth wave. Yesterday New Jersey reported an additional 29 deaths. Virginia reported 30 yesterday, not the highest, but for that we need look only to the day before when it reported 48.

In other words, I want to see if these recent high numbers of new deaths are the result of delays from Labour Day or if we are beginning to see an actual reversal in the trend. Both at this point appear plausible. I would suspect, however, that New Jersey, given its peaked number of new cases, is more likely to be on actual downward trend in deaths. That said, given the recentness of that peak, I would still expect deaths to rise. I want another week’s worth of data to better evaluate the Garden State.

Virginia seems pretty clear to me, the most likely cause in the dip in deaths of late relates to the holiday. With the numbers of new cases continuing to climb and a peak appearing to be at least a few weeks away, Virginia probably can expect deaths to resume climbing for a bit more time. And of course this wave is already worse, in terms of deaths, than the third wave. Unfortunately I think that story line will only get stronger.

Credit for the piece is mine.

Protecting New Jersey’s Back Bay Communities

We’re back and there’s a lot to touch on this week. But first, as a prelude to some of the Hurricane Ida coverage, I wanted to briefly point our attention to an article in the Philadelphia Inquirer from about two or three weeks before Ida struck.

The article focused on the US Army Corps of Engineers proposal to protect the back bay areas of the South Jersey shore, i.e. the areas between the outer barrier islands and the mainland. The article chose a few graphics from the report to draw attention to some of the proposed solutions, e.g. massive gates, new levee systems, and wetland restoration.

I wanted to focus on a different graphic in the report. This functioned more as an illustrated guide to the whole suite of solutions available to mitigating flood and storm surge disasters. Because, in the future, rising sea levels will threaten coastal communities. And as we saw just last week here in the Northeast, warmer seas plus warmer skies increase the potential for storms with crippling deluges.

Image links to the report, not the article

The graphic shows how we can try to deal with surge waters from out beyond the barrier islands through to the back bay to communities inland both by protecting, adapting, and in some cases relocating.

All need to be on the table, because if last week showed us anything—not that many hadn’t been saying this for decades—it’s not just the bayous of New Orleans and Florida’s beaches that are at risk from environments and weather patterns altered by a changing climate, but even those areas more local (to the Northeast).

Credit for the piece goes to the US Army Corps of Engineers.

Covid Update: 31 August

A brief bit of housekeeping, your author will be taking a brief holiday during which I won’t be posting. But I should return to posting next week.

Last week we looked at some relatively good news in terms of the spread of Covid-19 in the states of Pennsylvania, New Jersey, Delaware, Virginia, and Illinois. We had been watching some recent data that suggested some of the five states had begun to see a real and measurable slowdown in the rate of increase of new cases.

Where are we this week? Did those trends continue?

Kind of…

New case curves for PA, NJ, DE, VA, & IL.

Starting with the bad news, we have Virginia. Old Dominion is now in the throes of a Covid wave worse than what it saw back in the Spring, and fast nearing the heights—maybe depths—of the winter wave. That wave peaked just under 6,200 new cases per day. (And fell to as low as 129 per day in mid-June.) Presently Virginia reports just under 3,200 new cases per day, or nearly half that previous peak. Unfortunately, we do not yet see any significant decline in the rate at which those case numbers have increased. The line in the graphic above is no longer curving upwards, instead you can describe it as more a straight line.

Somewhere below Virginia in that it’s not bad news, but it’s also not great news, we have Delaware and Illinois. In both states the unfortunate news remains that new cases continue to increase. But at present we can also see that new cases have begun to slow. In these states the curves have begun to flatten out, though they still tilt positive.

Contrast that to New Jersey, where we have good news. The Garden State looks similar to Delaware and Illinois, but the difference is the curve in New Jersey may have peaked. The line is now tilted negative. Of course, this is a very recent development and we would want to see this trend continue for a week or so before we begin to speak of New Jersey’s wave cresting.

But between New Jersey, Delaware, and Illinois, I would not be surprised if by the end of next week we begin to see new cases peaking and beginning to decline.

But what about Pennsylvania? Initially I would have placed the Commonwealth with Delaware and Illinois as it clearly had not peaked, but it did exhibit evidence that its curve was beginning to flatten. In recent days, however, as one can clearly see in the chart, the average has begun to shoot back up again. But as I cautioned last week, that’s not uncommon. Consequently, I want to see another week’s worth of data before we begin to talk about what direction Pennsylvania is taking.

In all this though, we do have one wildcard. This weekend we in the United States begin our Labour Day holiday. Will Labour Day gatherings and parties lead to increased spread of the virus? Will we have super-spreader events? Unfortunately we will not know for about a week or week and a half after the holiday.

As all this has been happening, we also have the death rate.

Death curves for PA, NJ, DE, VA, & IL.

Last week I noted that we had begun to see rising numbers of deaths. This made sense given that deaths lag behind new cases. Early in the pandemic it often—not always—took a few weeks before people needed hospitalisation. Then a few weeks later is often—not always—when people would die. So a few weeks after the fourth wave began to take hold we continue to see rising numbers of deaths in all five states.

In Virginia and Illinois we see two of the most significant increases. In fact in the third wave, Illinois peaked at just under an average of 32 deaths per day. As of Tuesday the seven-day average was at just over 25. And with the current trend pointing towards increasing death, it’s possible we could see the fourth wave be more lethal in Illinois than the third.

Compare that to Virginia. Old Dominion saw a smaller death rate in the third wave, peaking at 18 deaths per day. However, just yesterday the state reached an average of 21 deaths per day. In other words, Virginia’s fourth wave has become more lethal than its third wave. Unfortunately, like in Illinois we continue to see deaths climbing and there is no evidence yet that deaths are slowing down.

In the tri-state area we see some slightly better news by comparison. In Pennsylvania and New Jersey deaths remain below their third wave peak. For example, in the third wave, Pennsylvania peaked at nearly 50 deaths per day. Yesterday the average was just below 20. Despite both states being below their third wave peaks, however, deaths do continue to climb.

Delaware is the exception here. With such a small population, it reached a third wave peak of about two deaths per day. At present it’s just reached three. But I would not say that three is significantly greater than two.

Overall, however, I expect to see deaths continue to climb over the next week or two until these slowing rates of new cases begin to create slowing death rates. And so I am hopeful that in the coming few weeks we will begin to see the new case rates slow, peak, and begin to decline by about mid-September. That’s more likely in places like New Jersey, Delaware, and Illinois, but if we’re lucky those patterns or similar will soon begin to emerge in Pennsylvania and Virginia.

Credit for the piece is mine.

Spread of the Caldor Wildfire

I’ve been searching to see if I could find a better motion graphic of this, alas not. I saw a post on Instagram from the Washington Post that featured a timelapse video or graphic of the spread of the Caldor Wildfire. The Caldor Wildfire presently rages southwest of Lake Tahoe and has now forced the evacuation of South Lake Tahoe, a city of more than 20,000 people.

If you can’t stand the heat?

The motion graphic just does a nice job of simply capturing the spread, in both direction and speed, of the wildfire. Obviously, the whole area could use the inches of rain that Hurricane Ida is dumping on the eastern half of the country.

Credit for the piece goes to the Washington Post graphics department.

Big Beer

A few weeks back, a good friend of mine sent me this graphic from Statista that detailed the global beer industry. It showed how many of the world’s biggest brands are, in fact, owned by just a few of the biggest companies. This isn’t exactly news to either my friend or me, because we both worked in market research in our past lives, but I wanted to talk about this particular chart.

Not included, your home brew

At first glance we have a tree map, where the area of each “squarified” shape represents, usually, the share of the total. In this case, the share of global beer production in millions of hectolitres. Nothing too crazy there.

Next, colour often will represent another variable, for market share you might often see greens or blues to red that represent the recent historical growth or forecast future growth of that particular brand, company, or market. Here, however, is where the chart begins to breakdown. Colour does not appear to encode any meaningful data. It could have been used to encode data about region of origin for the parent company. Imagine blue represented European companies, red Asian, and yellow American. We would still have a similarly coloured map, sans purple and green,

But we also need to look at the data the chart communicates. We have the production in hectolitres, or the shape of the rectangle. But what about that little rectangle in the lower right corner? Is that supposed to be a different measurement or is it merely a label? Because if it’s a label, we need to compare it to the circles in the upper right. Those are labels, but they change in size whereas the rectangles change only in order to fit the number.

And what about those circles? They represent the share of total beer production. In other words the squares represent the number of hectolitres produced and the circles represent the share of hectolitres produced. Two sides of the same coin. Because we can plot this as a simple scatter plot and see that we’re really just looking at the same data.

Not the most interesting scatter plot I’ve ever seen…

We can see that there’s a pretty apparent connection between the volume of beer produced and the share of volume produced—as one would (hopefully) expect. The chart doesn’t really tell us too much other than that there are really three tiers in the Big Six of Breweries. AB Inbev is in own top tier and Heineken is a second separate tier. But Carlsberg and China Resources Snow Breweries are very competitive and then just behind them are Molson Coors and Tsingtao. But those could all be grouped into a third tier.

Another way to look at this would be to disaggregate the scatter plot into two separate bar charts.

And now to the bars…

You can see the pattern in terms of the shapes of the bars and the resulting three tiers is broadly the same. You can also see how we don’t need colour to differentiate between any of these breweries, nor does the original graphic. We could layer on additional data and information, but the original designers opted not to do that.

But I find that the big glaring miss is that the article makes the point despite the boom in craft beer in recent years, American craft beer is still a very small fraction of global beer production. The text cites a figure that isn’t included in the graphic, probably because they come from two different sources. But if we could do a bit more research we could probably fit American craft breweries into the data set and we’d get a resultant chart like this.

A better bar…

This more clearly makes the point that American craft beer is a fraction of global beer production. But it still isn’t a great chart, because it’s looking at global beer production. Instead, I would want to be able to see the share of craft brewery production in the United States.

How has that changed over the last decade? How dominant are these six big beer companies in the American market? Has that share been falling or rising? Has it been stable?

Well, I went to the original source and pulled down the data table for the Top 40 brewers. I took the Top 15 in beer production, all above 1% share in 2020, and then plotted that against the change in their beer production from 2019 to 2020. I added a benchmark of global beer production—down nearly 5% in the pandemic year—and then coloured the dots by the region of origin. (San Miguel might not seem to fit in Asia by name, but it’s from the Philippines.)

Now I can use a good bar.

What mine does not do, because I couldn’t find a good (and convenient) source is what top brands belong to which parent companies. That’s probably buried in a report somewhere. But whilst market share data and analysis used to be my job, as I alluded to in the opening, it is no longer and I’ve got to get (virtually) to my day job.

Credit to the original goes to Felix Richter.

Credit for my take goes to me.