

Data Viz Considerations
Grayson White
Math 241
Week 1 | Spring 2026
Day 1 Lecture
Day 2 Lecture
Develop language to talk about the components of a graphic
Discuss considerations for good graphical design
Graphics consideration activity
PSET 0 assigned
The Grammar of Graphics

Leland Wilkinson wrote a book called “The Grammar of Graphics”

As part of his PhD in Statistics at Iowa State, Hadley Wickham wrote the R package ggplot2, which we will use to create static graphs.
data: dataset that contains the data
geom: geometric shape that the data are mapped to
aesthetic: visual properties of the geom
coord: coordinate system
scale: controls how data are mapped to the visual values of the aesthetic
guide: legend to help user convert visual display back to the data


Let’s practice deconstructing this graph using the grammar of graphics.

Let’s practice deconstructing this graph using the grammar of graphics.

For most data, there won’t be just one way to graph it. Decisions to be made:
Let’s discuss some considerations that can help guide these decisions. But…
“Data visualization is part art and part science. The challenge is to get the art right without getting the science wrong and vice versa.” – Claus Wilke
Recommendation: Try out different options and make sure to iterate!














Some aesthetics are ordinal. Some are not. Some can be both!

Our ability to perceive differences varies by aesthetic!


Generally, humans are good at making comparisons involving:
Generally, humans are NOT good at making comparisons involving:
Principle of proportional ink: The size of shaded areas need to be proportional to the data values they represent.

Principle of proportional ink: The size of shaded areas need to be proportional to the data values they represent.

Difficult to respect with spatial data. Why?

Instead of using geographic boundaries, pick a standardized shape and place “near” geographic location.

Data-ink ratio: “proportion of a graphic’s ink devoted to the non-redundant display of data-information.” – Edward Tufte


Credit: Mine Çetinkaya-Rundel
Consider including:
What to add depends greatly on the research question or the story you are telling!

Consider including:
Context should add both memorability and clarity.

Faceting is a great way to add another variable without over-complicating your graphic.
But only add additional variables that are useful to the story!

Over-plotting is very common in the Age of Big Data!
Example from my own work with the US Forest Inventory and Analysis Program




geom.Pros and cons of different approaches?
Which graph makes it easy to conclude that the ruling coalition (FDP + SPD) have a majority?

Which graph makes it easy to see how a company’s market share changes over time? (Warning: Fake data.)

Not all R color palettes have been vetted for color blindness.

Not all R color palettes have been vetted for color blindness.



Use white space to help separate elements.

Use a large enough font size!

15:00