index.knit

It’s these functions we have not yet really been paying much attention to.

Instead, we’ve been focusing on those that map the data to the visual channel, right?

Here I’m showing the same pseudo-code we’ve seen before, but now I’m circling the functions where you code how this non-data ink looks. The things on the right.

So these all provide help and context for our audiences to read our data encodings.

In fact, when we use the sort of styled themes, like “theme_minimal” or “theme_void” or “theme_tufte” or “theme economist,” all these themes do is alter the default settings in the function “theme”.

So if you want to know how a named theme creates a certain look, you can look at it’s settings. Let me show you now.

[SHOW THEM]

All this leads to the idea of how much of data-ink and non-data ink improves versus inhibits explaining the graphic to others.

We’re all now quite familiar with what Doumont said of our goals in communicating with others. It is to

“get our audiences to pay attention to, understand, and be able to act upon a maximum of messages, given constraints.”

And we will be practicing doing that for one audience, the analytics executive, in our individual homework number three.

That audience is assumed to understand the details and history of the organization they work at, and assumed to understand data science concepts, perhaps, in some cases, even better than we do. What will be new to them is our proposed project and reason for proposing it, right?

What about an audience is external to the organization?

It’s typically broader and a more general audience. This means we assume they likely know less about our organization than the analytics executive, right?

Because they don’t have daily exposure to it. Perhaps they haven’t even yet been introduced. Or maybe they know more. That depends on your external audience and purpose of selecting them.

This external, broader or more general audience may also not, on average, have the depth of training in data science, even if some individuals do.

How can we communicate with such mixed audiences?

As Doumont suggests, we should try for every part of the communication to provide something interesting to those more knowledgable in our target group while helping those less familiar, the generalists, in your target group understand too.

Doumont, if you recall, discusses these mixed audiences, and gave us a short example on how we can approach that communication.

[POLL AVAILABLE BELOW RE “MESSAGE”]

In Doumont’s first version, and I’ve pulled this example from your reading earlier in the semester from his chapter Fundamentals. It states,

“We worked with IR.”

If some in the audience do not know what IR is, you confuse them and they lose interest. You alienate them in your communication.

Let’s read the second version.

“We worked with IR. IR stands for Information Resources and is a new department.”

What do you think of this? Does it help those in your mixed audience that did not know what IR stood for? How? What is that second sentence? A definition? You’re telling them a definition, right?

But what about those in the audience that already know what IR stands for?

How do they feel when they hear you telling them a definition they already know? Maybe they feel you do not understand them because if you did, you would know that they know? You’re not speaking with them from their point of view.

These first two sentences demonstrate the challenges of a mixed audience.

Doumont suggests a solution. We should weave in explainers into our sentences — and we will discuss this in the next few weeks — that help the generalists in our audience, but give the specialist something new or interesting to consider. Does this third sentence do that?

This idea is directly applicable to graphics, too, and we’ll get to that in a bit.

Let’s start with titles.

Aren’t titles an overall annotation to a graphic? While it is more common to see a title as a generic description of the data, that’s a waste of space. Instead, we should use titles to explain what we’re trying to show with the graphic. If the point is to show some pattern in the data, then use the title to explain the pattern as shown in the graphic.

I’ve contributed this graphic as part of our class example of the Dodgers, which we will begin to consider next week.

This audience is the Dodgers marketing executive, though of course, it could be used in a particular context for other audiences.

If we look only at the graphic on the left, does it explain what it is or what the marketing executive should take away from it?

What about on the right? Now you haven’t seen the additional context this came from yet. But take a moment to review it.

Does the title explain what the data shows as opposed to only what it is? Does it have a point? Explain.

It’s the same reason we will use informative titles in our future assignments, like the memo and proposal. The same reason we should use informative headers instead of generic headers in our proposal. Does the difference make sense? Questions about that?

Where else can we explain? Where else can we annotate?

How about directly on the graphic itself! Let’s see another example.

I’ve pulled this example from the source’s twitter feed as part of a Storytelling with Data Challenge. On the left, is the data encoding without explanations. We have no idea what these encodings show, right?

On the right, I’ve shown the title now, and it at least does explain a little about the patterns in the data, not just what the data is: the rise and fall, right? But more than that, we see annotations directly on the graphic. What do these mini paragraphs with lines pointing to the encodings on the graphic do?

Would you agree that it places the data into context, helps show or implies cause and effect?

Now, the graphic does one more thing we’ve discussed before, right? It uses color to tie concepts together. How does this graphic use color?

Now these ideas are not just theory in communication. Let’s hear from well-known practitioners in the field.

All the practitioner experts agree. Amanda Cox is the Data Editor at the New York Times. Let’s read what she explains together.

“The annotation layer is the most important thing we do.” “Most important.”

Another author and alumnus of Columbia University recently published a book that I recommend on starting data visualization, and in it he writes,

“Although the primary focus on creating a visualization is the graphic elements—bars, points, or lines—the text we include in and around our graphs is just as important.”

Let’s hear from one more voice. Shirley Wu is a very impressive visualization designer. Her work has appeared in high profile publications. In her recent book, which is also a wonderful reference, she writes,

“Annotations are of vital importance. Often overlooked, annotations are one of the best ways to make a chart understandable to an audience. Underutilized in many data visualizations, annotations are the ideal way to highlight exactly those things that you, as the creator, want the audience to pay attention to.”

So as we build graphics to explain, I’m going to encourage you to place equal importance on your annotations as on your data encodings! All the concepts in writing we’ve discussed remain valid for these annotations!

[SCROLL DOWN TO USE A CAPTION CONTEST!]

We’ve mentioned that layering graphics is part of the grammar of graphics. And layering explicitly including explanations on graphics, and using things like font sizing, boldness, and negative space to create hierarchies of information.

So in terms of layering information together as a data graphic,

1. we consider how to scale our x and y coordinates, then
1. we decide what attributes of the observation to place on those coordinates.
1. Then we consider all the other visual channels, individually or in combination, we can use to encode other data attributes.
1. We label anything important for our audience to understand, and we include mini-paragraphs — explainers — directly on the graphic.

Finally, we add a message for the title, we try graying our or lightening things we need for context but aren’t the direct point, and we use color or connection, for example, to link the graphic with text.

By the way, I’m giving you the code that I used to make each of these so that you can study them and become more familiar with how we can implement layering. More generally, you should definitely study all the code I give you as I write it very precisely so that you can learn from it.

Questions so far?

All this is to steer us away from common but terrible advise to dumb down your graphical encodings. Don’t!!

Let’s start with this example. It’s from the New York Times Opinion Section, for a general audience. This is not a basic bar graph, right? Now it’s out of context, but take a moment to think about what kind of data it seems to be encoding.

One variable are categories of disease, and within the category, it uses position along the x axis to show racial differences in mortality of the disease. It uses position along the y axis to show overall rate, or how large a problem it is. It shades area as the interaction of these two variables.

Now on the graphic itself, the New York Times included a title, labels, and a few explainers, right?

Do you see any explanations on this graphic of how to read it? And that’s not all the context, either, right? This graphic is also within an article, and that article also helps with context, helps explain the content to their external, general audience. In the upcoming weeks, we will be discussing how to combine graphics and narrative.

By the way, Newspapers do separate out their papers into sections. So it may be that to some extent they have different external audiences for different sections. This graphic, again, is in the opinion section.

Let’s consider, then, a graphic from another section.

I’ve pulled this graphic example the business section. From an article on the growth of e-commerce. What types of encodings do we see?

Not so simple as a bar, single line, or pie chart, right?

Again, this is for a general, external audience. We don’t need to dumb down our graphics or show less data; we need to explain the encodings after we optimize them for the purpose we want to show.

And here, what do the authors do to explain the graphic? Do they gray out some data encodings as background context? Do they label the data directly? Do they explain by directly annotating the graphic? How do they use color? Is it used for a particular purpose?

Again, this graphic gains even more context within the document that it lives in.

Notice, also, that it fits within an overall narrative. A narrative that gives perhaps a little surprise. Ecommerce is growing, BUT, still a small component. Why: the resolution: it’s less labor intensive. This is starting to look like a narrative arc, right?

This graphic, I’m showing from the Science section. I’m showing it actually within the article itself, or at least part of the article.

Now this encoding may look at first a little like a line chart. But it’s different. What is it encoding, and how?

As lines go, this one is pretty unfamiliar to some general audiences. What forms of explanation and annotation do you see?

You can read the article from the link I’ve shared to get more context. I’ve also included a second link because this type of chart was used to study audience engagement with unusual graphics. Would they run away or ignore it?

The researchers learned that general audiences were intrigued by the unusualness and complexity, and would engage when the authors provided clear explanations.

I hope this provides evidence of my point tonight. Which is what? [optimize encodings, then explain for your audience].

Awesome!

Now, my question is, what’s the point of this graphic in this one-page news release, this important communication on the US economy?

Do the encodings optimally explain the point? And, can we do better?

So here, I’m throwing down a challenge. I’d like to place you into groups. I’ll give you the starting data that I pulled from this release to get you going. And I’d like each of your groups to try to redesign it. Now how should you use your time in the groups? Here’s my suggestion. One of two approaches:

First, spend a few minutes just discussing what you think works well or does not work well in this graphic for whatever purpose you see it serving.

Second, and here’s where you can decide one of two ways. One way is each of you within the group individually try to encode the graphic in a different way than what you see. You can either use paper and pencil or R/GGplot or both.

Each of you spend, say, 5 minutes getting started individually. If you still don’t know either, there is no time like now to practice. Alternatively, you could have one member share the screen and you can all collectively build one together.

Third, start working together as a group, show each other where you are in your re-design, if you are stuck, whether anyone in your group can give guidance on how to solve your conceptual or tool issue. The point is to cross-share ideas and help each other. Be patient. Be helpful.

Fourth, discuss which approaches from your group you collectively want to contribute to an overall class document. And each group must at least contribute one collectively to the class. If you can’t decide on just one, you may submit two. Along with your redesign, write one or two sentences that say what you tried and why you think it did or did not help.

Does that make sense? These can also be mistakes where you tried something and think it didn’t work. Just label it as a mistake, and in class discussion, you can talk about why you tried it and why you think it didn’t work.

Once the groups have contributed to the google shared doc, we’ll come back as a class.

Remember, this is just practice, and it’s ok to feel like you messed up, or whatever. Sometimes the graphic mistakes can also be fun to look, think through, and learn from.

[GROUP WORK]

Excellent! I love your attempts and examples. And I hope this is helping you to start thinking critically about how encodings and explanations can either help or distract. How they impact our audience’s understanding.

Let me show you a few ideas I had, too. Let’s go through three of them together.

[SCROLL DOWN TO SEE 3 REDESIGNS]