When we visualise data, we present it in a form that is more engaging and easier to digest than plain numbers. Charts are the most common form of data visualisation and there are many different types to choose from. The charts that are most common are bar charts, scatterplots, horizontal bar charts, pie charts, histograms and line graphs.
The most effective use of these charts are those that are clear, simple and intentional.
- Clear: Charts are used to inform. If the chart is overly complicated or unfamiliar to the target audience then it is not fit for purpose.
- Simple: Good chart design lets the data speak for itself. The inclusion of unnecessary flair like 3D columns, shaded or outlined bars can all detract from you are trying to convey with the chart.
- Intentional: You should know what story you are trying to tell with your data and what you want your reader to take from it before embarking on the visualisation.
Good chart design should accentuate the data itself and the story you are trying to tell, unnecessary clutter, or overly complicated charts can often detract from the intended message.
Choosing Your Chart
Your choice of chart will be constrained the type of data you are working with, this excellent decision tree, created by 'from data to viz', lets you take a look at some of the most common data visualisations and their suitability for different data types.
Knowing your data type and the number of varibales that you are visualising can help you to chose the most effective chart. Data can fall into the following categories:
In addition to the type of data, your choice of chart will also influenced by the story you are telling with your data. Often, such stories fall into the following four categories:
- Comparison: These stories allow the reader to visually compare two or more points of data.
- Deviation: These charts provide a visual representation of the variation of the data about a standard reference point.
- Distribution: These provide a visual summary of the frequency which values occur within a variable or dataset.
- Relationship: These stories focus on visualising the relationship or correlation between two or more variables.
You may want to tell more than one story with a single dataset. To demonstrate each of the four stories in action we can use the data collected for the World Happiness Report 2019.
If we wanted to compare the average happiness of continents, we might use a bar chart. A horizontal bar chart is a good choice for this, the orientation allows you to label each element on the y axis efficiently, without rotation. For maximum clarity, bars are ordered prior to plotting.
We could also look at the deviation of each country in the Americas from the continent-wide average. A diverging bar chart is a good choice for this. In this instance the use of colour can accentuate positive and negative values.
If we wanted to look at the relationship between two variables which contribute to the happiness of a country we could use a scatter plot. The examples below plot each country’s Gross Domestic Product (GDP) per capita against the healthy life expectancy and the level of social support against the freedom to make life choices.
Finally, if we wanted to look at the distribution of global happiness we could use a histogram, histograms are one of the most common ways to represent the shape of a variable.