As I said yesterday, since people are finding these updates helpful on the social media, I am going to repost the previous evening’s graphics I make on the Coronavirus Covid-19 outbreak here on Coffeespoons as well. So while today is Thursday, these are the numbers states provided yesterday, so it’s more of a Wednesday update.
But here I can start with the flatter curves graphic. The New Jersey numbers in particular look good—I mean they’re still bad. Of course we are just a few big breaches of quarantine and lapses in social distancing from reversing that progress.
Maybe some curve flattening?
State-wise, Pennsylvania continues to worsen. However, a close look at the slope of the line in the previous chart indicates that the steepness of the growth may be lessening. Deaths passed 300 and cases are now firmly entrenched on both sides of the state with the rural, less densely populated areas in the Ridge and Valley portion of the state seemingly hit not as hard.
The situation in Pennsylvania
Despite the potential flattening, New Jersey is just in a rough spot. The final bastions of low case numbers in South Jersey are slowly filling up as Cape May County passed the 100-case threshold.
The situation in New Jersey
Delaware continues to accelerate and is now past 1000 cases.
The situation in Delaware
Virginia continues to see cases spreading in the eastern, more populous portions of the state. And at 75 deaths, it’s nearing the 100-death threshold.
The situation in Virginia
Illinois is seeing deaths occur away from Chicago, in the St. Louis suburban counties and in and around Springfield and Champaign and Bloomington areas.
Yesterday I mentioned that there was some data to suggest that at least in New Jersey the curve was flattening. Monday we received additional data and so I wanted to share what that data showed.
I used a set of bar charts to show the new daily cases yesterday for Pennsylvania, New Jersey, Delaware, Virginia, and Illinois. But as I mentioned, a single day can be a blip. Noise. We want to find the pattern or the signal within that data set. Consequently I applied a simple 7-day rolling average to the new daily cases data set.
I chose seven days for two reasons. The first was that the onset of the symptoms is 5–10 days after infection. Picking a mid-point in that range assures us that people who are infected are beginning to appear in the data. Secondly, a cursory check of the data suggests that reported numbers dip lower on weekends. And so making a week-long average covers any possibility of lower values at week’s end.
That preface out of the way, what do we see? Well, there is some evidence that the curve is flattening in New Jersey. The lines below represent that rolling average. And if you look at the very top of the New Jersey curve, you can see it beginning to flatten.
Flattening curves
Unfortunately that does not mean New Jersey is out of the woods. Not by a long shot. Instead, that means tens of thousands of people will still be infected. And hundreds more will die. But, the rate at which those two things happen will be lower. Hopefully hospitals will not be as overwhelmed as they presently are. And that might make for a lower total death count.
The data does not support, however, the notion that the curve is flattening in the other states. Consider that the United States spans a continent and contains over 330 million people. The outbreak will look different in different states. Compare Pennsylvania and Illinois, which have similar case numbers. But in Pennsylvania we have more cases in smaller cities and rural areas and fewer in the largest cities. Plus, of course, we have the different measures taken by different states to contain and mitigate the pandemic within their borders.
But, we do have some data to suggest that at least in New Jersey the curve is flattening. I’ll take good news where I can find it. (Even if it comes from Jersey.)
This past weekend I continued looking at the spread of COVID-19 across the United States. But in addition to my usual maps of Pennsylvania, New Jersey, Delaware, Virginia, and Illinois, I also looked at the number of cases across the United States adjusted for population. I then looked at the five aforementioned states in terms of new cases to see if the curve is flattening. Finally, I looked at the number of hospital beds per 1000 people vs the number of cases per 1000 people.
The latter in particular I wanted to be an examination of hospitalisation rates vs ICU beds, which are a small fraction of total hospital beds. But as I could not find that data, I made do with overall cases and overall beds.
So first let’s look at the cases across the U.S. What you can see is that whilst New York and New Jersey do have some of the worst of the impact, Washington is still not great and Louisiana and Michigan are also suffering.
The situation across the United States
And then when we look at the states by their cases per 1000 people and their hospital beds per 1000 people, we see that the states often claimed to be overwhelmed, New York, New Jersey, and Washington are all well over the blue line, which indicates an equal number of beds and cases per 1000 people, or near it. Because it is important to remember that not all beds are the type needed for COVID-19 victims, who often require the more fully kitted out ICU beds. Additionally, not all cases are severe enough to warrant hospitalisation.
Cases per 1k people vs hospital beds per 1k people
Then from the broader national view, we can look at the states of interest. Here, those of you who have been following my social media posts, you can see fewer dark purples in these maps. That’s because I have adopted a new palette that has sacrificed granularity at the lower end of the scale and added it at the top, a particular need in New Jersey and the Philadelphia and Chicago metro areas. And finally we look at the daily new cases to see if that curve is flattening.
Pennsylvania now has almost every county infected. But unlike Illinois, which has a similar infection rate but more unaffected counties, Pennsylvania has fewer cases in its big city, Philadelphia, and has more cases in the smaller cities and towns.
The situation in Pennsylvania
New Jersey is just a disaster. Deaths are now reported in every county—so I can probably remove those orange outlines. The only potential good news is that new cases for the second day in a row were fewer than the day before. It could be a blip. But it could also be a signal that the peak of infection has or is nearing. That said, hospitalisations and deaths are lagging indicators and could take two weeks to follow the positive test results. So in the best case scenario that this is a peak, New Jersey is far from out of the woods.
The situation in New Jersey
Delaware is the smallest state I look at—and one of the smallest in the union overall—but its cases are worryingly increasing rapidly, although like every state I examine in detail it had fewer new cases Sunday than Saturday.
The situation in Delaware
Virginia is in a better spot overall than the other four states. You can see that in the national map above. And most of Virginia’s cases are concentrated in the DC and Richmond areas as well as the cities along the peninsulas jutting into the Chesapeake.
The situation in Virginia
Illinois is, as noted above, similar to Pennsylvania in terms of infections. In terms of deaths, however, it is doubling Pennsylvania’s numbers. And most of its cases are located in and around Chicago. Big chunks of downstate Illinois are unaffected or lightly affected compared to the Commonwealth.
The situation in Illinois
Finally, as I noted in New Jersey, could these lower numbers Sunday than Saturday be meaningful? Possibly. But in all five states? Highly unlikely. Regardless, we can look at the number of daily new cases and see if that curve of infection is flattening. We should wait several days before beginning to make that assessment. But one can hope.
The case for flattening curves
All of this is to say that things are bad and likely will continue to get worse. But I will keep looking at the data daily and presenting it to the public to keep them informed.
The British election campaign is wrapping up as it heads towards the general election on Thursday. I haven’t covered it much here, but this piece from the BBC has been at the back of my mind. And not so much for the content, but strictly the design.
In terms of content, the article stems from a question asked in a debate about income levels and where they fall relative to the rest of the population. A man rejected a Labour party proposal for an increase in taxes on those earning more than £80,000 per annum, saying that as someone who earned more than that amount he was “not even in the top 5%, not even the top 50”.
The BBC looked at the data and found that actually the man was certainly within the top 50% and likely in the top 5%, as they earn more than £75,300 per annum. Here in the States, many Americans cannot place their incomes within the actual spreads of income. The income gap here is severe and growing. But, I want to look at the charts the BBC made to illustrate its points.
The most important is this line chart, which shows the income level and how it fits among the percentages of the population.
Are things lining up? It’s tough to say.
I am often in favour of minimal axis lines and labelling. Too many labels and explicit data points begin to subtract from the visual representation or comparison of the data. If you need to be able to reference a specific data point for a specific point on the curve, you need a table, not a chart.
However, there is utility in having some guideposts as to what income levels fit into what ranges. And so I am left to wonder, why not add some axis lines. Here I took the original graphic file and drew some grey lines.
Better…
Of course, I prefer the dotted or dashed line approach. The difference in line style provides some additional contrast to the plotted series. And in this case, where the series is a thin but coloured line, the interruptions in the solidity of the axis lines makes it easier to distinguish them from the data.
Better still.
But the article also has another chart, a bar chart, that looks at average weekly incomes across different regions of the United Kingdom. (Not surprisingly, London has the highest average.) Like the line chart, this bar chart does not use any axis labels. But what makes this one even more difficult is that the solid black line that we can use in the line charts above to plot out the maximum for 180,000 is not there. Instead we simply have a string of numbers at the bottom for which we need to guess where they fall.
Here we don’t even a solid line to take us out to 700.
If we assume that the 700 value is at the centre of the text, we can draw some dotted grey lines atop the existing graphic. And now quite clearly we can get a better sense of which regions fall in which ranges of income.
We could have also tried the solid line approach.
But we still have this mess of black digits at the bottom of the graphic. And after 50, the numbers begin to run into each other. It is implied that we are looking at increments of 50, but a little more spacing would have helped. Or, we could simply keep the values at the hundreds and, if necessary, not label the lines at the 50s. Like so.
Much easier to read
The last bit I would redo in the bar chart is the order of the regions. Unless there is some particular reason for ordering these regions as they are—you could partly argue they are from north to south, but then Scotland would be at the top of the list—they appear an arbitrary lot. I would have sorted them maybe from greatest to least or vice versa. But that bit was outside my ability to do this morning.
So in short, while you don’t want to overcrowd a chart with axis lines and labelling, you still need a few to make it easier for the user to make those visual comparisons.
Credit for the original pieces goes to the BBC graphics department.
For my American audience, this week is Thanksgiving. That day when we give thanks for Native Americans giving European settlers their land for small pox ridden blankets. And trinkets. Don’t forget the trinkets. But we largely forget about the history and focus on three things: family, food, and American football. Not necessarily in that order.
But this week I am largely going to want to focus on the food.
Today we can look at a graphic coming from a team of researchers at the University of Illinois who examined the flows of food across the United States, down to the county level. It helped produce this map that shows the linkages between counties.
Oh look at that Mississippi River trail
To be sure, the piece uses some line charts and other maps to showcase the links, but the star is really this map. But aside from its lack of Alaska and Hawaii, I think it suffers from one key design choice: leaving the county borders black.
The black lines, while thin, compete with the faint blue lines that show the numerically small links between counties. Larger trade flows, such as those within California, are clearly depicted with thicker strokes that contrast with the background political boundaries of the counties. But the light blue lines recede into the background beneath the borders.
I wonder if a map of solid, light grey fills and white county borders would have helped showcase the blue lines and thus trade flows a little bit better. After all, the problem is especially noticeable in the eastern half of the United States where we have much geographically smaller counties.
Hat tip to friend and former colleague Michael Schaefer for sharing the article in question.
A few weeks ago we said farewell to John Bercow as Speaker of the House (UK). Whilst I covered the election for the new speaker, I missed the opportunity to post this piece from the BBC. It looked at Bercow’s time in office from a data perspective.
The piece did not look at him per se, but that era for the House of Commons. The graphic below was a look at what constituted debates in the chamber using words in speeches as a proxy. Shockingly, Brexit has consumed the House over the last few years.
At least climate change has also ticked upwards?
I love the graphic, as it uses small multiples and fixes the axes for each row and column. It is clean, clear, and concise—just what a graphic should be.
And the rest of the piece makes smart use of graphical forms. Mostly. Smart line charts with background shading, some bar charts, and the only questionable one is where it uses emoji handclaps to represent instances of people clapping the chamber—not traditionally a thing that happens.
Content wise it also nailed a few important things, chiefly Bercow’s penchant for big words. The piece did not, however, cover his amazing sense of sartorial style vis-a-vis neckties.
Overall a solid piece with which to begin the weekend.
Credit for the piece goes to Ed Lowther & Will Dahlgreen.
In a recent Washington Post piece, I came across a graphic style that I am not sure I can embrace. The article looked at the political trifecta at state levels, i.e. single political party control over the government (executive, lower legislative chamber, and upper legislative chamber). As a side note, I do like how they excluded Nebraska because of its unicameral legislature. It’s also theoretically non-partisan (though everybody knows who belongs to which party, so you could argue it’s as partisan as any other legislature).
At the outset, the piece uses a really nice stacked bar chart. It shows how control over the levers of state government have ebbed and flowed.
You can pretty easily spot the recent political eras by the big shifts in power.
It also uses little black lines with almost cartoonish arrowheads to point to particular years. The annotations are themselves important to the context—pointing out the various swing years. But from an aesthetic standpoint, I have to wonder if the casualness of the marks detracts from the seriousness of the content.
Sometimes the whimsical works. Pie charts about pizza pies or pie toppings can be whimsical. A graphic about political control over government is a different subject matter. Bloomberg used to tackle annotations with a subtler and more serious, but still rounded curve type of approach. Notably, however, Bloomberg at that time went for an against the grain, design forward, stoic business serious second approach.
Then we get to a choropleth map. It shows the current state of control for each state.
X marks the spot?However, here the indicator for recent party switches is a set of x’s. These have the same casual approach as the arrows above. But in this case, a careful examination of the x’s indicates they are not unique, like a person drawing a curve with a pen tool. Instead these come from a pre-determined set as the x’s share the exact same shape, stroke lengths and directions.
In years past we probably would have seen the indicator represented by an outline of the state border or a pattern cross-hatching. After all, with the purple being lighter than the blue, the x’s appear more clearly against purple states than blue. I have to admit I did not see New Jersey at first.
Of course, in an ideal world, a box map would probably be clearer still. But the curious part is that the very next map does a great job of focusing the user’s attention on the datapoint that matters: states set for potential changes next November.
Pennsylvania is among the states…
Here the states of little interest are greyed out. The designers use colour to display the current status of the potential trifecta states. And so I am left curious why the designers did not choose to take a similar approach with the remaining graphics in the piece.
Overall, I should say the piece is strong. The graphics generally work very well. My quibbles are with the aesthetic stylings, which seem out of place for a straight news article. Something like this could work for an opinion piece or for a different subject matter. But for politics it just struck a loud dissonant chord when I first read the piece.
Credit for the piece goes to Kate Rabinowitz and Ashlyn Still.
Last we looked at the revenge of the flyover states, the idea that smaller cities in swing states are trending Republican and defeating the growing Democratic majority in big cities. This week I want to take a look at something a few weeks back, a piece from CityLab about the elections in Virginia, Kentucky, and Mississippi.
There’s nothing radical in this piece. Instead, it’s some solid uses of line charts and bar charts (though I still don’t generally love them stacked). The big flashy graphic was this, a map of Virginia’s state legislative districts, but mapped not by party but by population density.
Democrats now control a majority of these seats.
It classified districts by how how urban, suburban, or rural (or parts thereof) each district was. Of course the premise of the article is that the suburbs are becoming increasingly Democratic and rural areas increasingly Republican.
But it all goes to show that 2020 is going to be a very polarised year.
Well, everyone, we made it to Friday. So let’s all reflect on how many things we did on our mobile phones this week. xkcd did. And it’s fairly accurate. Though personally, I would only add that I did not quite use my mobile for a TV remote. Unless you count Chromecasting. In that case I did that too.
What about boarding passes?
If I have to offer a critique, it’s that it makes smart use of a stacked bar chart. I normally do not care for them, but it works well if you are only stacking two different series in opposition to each other.
The World Series began Tuesday night. But, as many people reading this blog will know, baseball is not exactly a global sport. But is it really? CityLab looked at the origin of Major League Baseball players and it turns out that almost 30% of the players today are from outside the United States. They have a number of charts and graphics that explore the places of birth of ball players. But one of the things I learned is just how many players hail from the Dominican Republic—since 2000, 12% of all players.
There are quite a few players from countries around the Caribbean.
The choropleth here is an interesting choice. It’s a default choice, so I understand it. But when so many countries that are being highlighted are small islands in the Caribbean, a geographically accurate map might not be the ideal choice. Really, this map highlights from just how few countries MLB ball players originate.
Fortunately the other graphics work really well. We get bar charts about which cities provide MLB rosters. But the one I really enjoy is where they account for the overall size of cities and see which cities, for every 100,000 people, provide the most ballplayers.
One of the other things I want to pick on, however, is the inclusion of Puerto Rico. In the dataset, the designers included it as a foreign country. When, you know, it’s part of the United States.