Data Storytelling
How to Choose the Right Chart or Graph for Your Data
“The greatest value of a picture is when it forces us to notice what we never expected to see.” — John Tukey
When business analysts and data scientists aren’t busy slicings, dicing, and modeling a huge amount of knowledge, they’re busy trying to effectively communicate their findings to decision-makers. — Abeer Aulakh
And that last part is the hardest!
In order to effectively communicate relevant information extracted from data, there has got to be an ideal combination of the proper charts and therefore the right story. This is where most analysts get it wrong. Either there’s an excessive amount of data within the presentation to return to any conclusion, or the findings are misaligned from the business.
This two-part series will explore different charts and graphs that analysts can use, and how they can develop storytelling as a skill.
In this part, we’ll specialize in charts which will be used for analyzing and presenting data. Let’s take a glance at what charts you’ll use to present your findings in an efficient way
Column
One of the most common types, column charts are used to show a comparison of items over a period of time or even a comparison among different items.
When creating column charts:
• Add color to bars to increase the impact. Overlying colors provide immediate insight and allow the viewer to quickly compare information
• Use these charts only when you have data that can be split into different categories
Bar
A bar chart is simply a horizontal column chart that should be used when the data label is long or you have a lot of items to compare.
When creating bar charts, follow the same tips as mentioned for column charts.
Line
Line charts, along with bar charts, are one of the most frequently used charts. They help connect individual data points and reveal trends over time.
When creating line charts:
• Use only solid lines
Dual Axis
A dual-axis chart contains a shared X-axis and two Y-axes. These charts are useful when you want to visualize a correlation between three data sets.
When creating dual-axis charts:
• Primary variable should be on the left side because we are naturally inclined to look to the left
• Use contrasting colors for the data sets
Pie
Pie charts are used to show the numeric proportion of information, which is represented in percentage. The sum of all segments in a pie chart equals 100%.
When creating pie charts:
• Use them only to show proportions
• Make sure that the slice value adds up to 100%
• Limit the number of slices to single digits. If you have more proportions, consider using a bar chart instead
Map
When you have location data, maps are a great way to chart out that data. Maps can be used to show geocoded data like the number of accidents by zip code, sales, and export-import by country, etc.
When creating maps:
• Use maps as a filter by combining them with other relevant data and then use it to drill down further into your data
• Layer maps with bubble charts to show the concentration of data. This helps measure and interpret the geographical impact of different data points
Scatter Plot
Scatter plots are a great way to uncover how different pieces of information relate to each other. They can also be used to show distribution trends, and are effective when looking for outliers.
When creating scatter plots:
• Add a trend line to make the correlation in your data stronger, but limit the number to two so that the data is easy to understand
• Add filters so that you can drill down into your data
Area
Simply put, an area chart is a line chart with the space between the x-axis and the line filled with a color. They are helpful for analyzing overall and individual trends.
When creating area charts:
• Make your charts easy to read by not displaying more than four categories
• Use transparent color so that the information isn’t cluttered
Bubble
Bubble charts are similar to scatter plots but contain a third data series which is represented by the size of the bubbles. If your data set contains three data series, consider using a bubble chart instead of a scatter plot.
When creating bubble charts:
• Use circular shapes
• Size bubbles according to area
Stacked Bar
Stacked bar charts are great when you have a number of different items that you want to compare.
When creating stacked bar charts:
• Use contrasting colors for clarity
Heat Map
When you want to compare data across two categories, a heat map is an excellent choice. It lets you compare data using color and lets you see the strongest and weakest categories in your data.
When creating heat maps:
• Vary the size of squares to show the third element, like the size of bubbles in a bubble chart
• Use a single color and saturate/desaturate to show changes
Bullet
Bullet charts are great when you have to track progress towards a goal and compare a primary measure to another one.
When creating bullet charts:
• Highlight the progress of your data by using contrasting colors
• Combine bullets with other charts in your dashboard to show progress
Histogram
Histogram charts are used when you have to see how your data is distributed across groups. By grouping your data and plotting it on vertical stacked bar charts, you can see the distribution according to different categories.
When creating histogram charts:
• To make sense of different groupings, create a variety of histograms to determine the most useful groupings
• let users drill down into different categories and explore the data with a filter
Classifying charts to facilitate better chart selection
The choice of charts is a function of the questions you’re attempting to answer, the data type, number of variables involved, variable type, the real-estate available, and several other reasons. However, if you intend to show relationships, distribution, comparison or composition here’s a simplified representation of chart types mapped to various conditions.