Skip to main content
Physics LibreTexts

2.7.1: Graphing Data

  • Page ID
    97198
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)
    Learning Objectives
    • Read and interpret data from tables and pictographs.
    • Read and interpret data from bar graphs and histograms.

    Introduction

    A nurse is collecting blood type data from her patients. When a new patient is checked in, the nurse does a simple finger-prick test to see whether the patient’s blood is type A, B, AB, or O. (These are the four possible blood types. Each one also carries a + or - to represent the RH factor, but for our purposes, let’s just track the type, not the + or -.) She tracks her results by creating a two-column table with the patient’s name and blood type.

    Name Blood Type
    Dominique A
    Ilya O
    Raul AB
    Madison O
    Philip AB
    Samuel B
    Josefine O
    Brett O
    Paula B
    Leticia AB

    The information in this table is an example of data, or information. In this case, the nurse has gathered a fair amount of data about her patients’ blood types. By analyzing the data, she can learn more about the range of patients that she serves.

    Data helps us make many kinds of decisions. Organizing data into graphs can help us get a clear picture of a situation and can often help us make decisions based on the picture. So how do you take data and make a picture out of it? Let’s take a look.

    Pictures of Data

    Let’s return to the data set used previously. If the nurse wanted to represent the data visually, she could use a pictograph. Pictographs represent data using images. This visual presentation helps illustrate that for the data in her table, Type O blood is the most common and Type A blood is the least common.

    Screen Shot 2021-05-17 at 2.19.55 PM.png

    Interested by the results of this small survey, the nurse continues to document the blood types of her patients until she has surveyed 100 people. She puts all of this data in a table, but she finds that it is hard for her to quickly identify what the data is telling her. She decides to make another pictograph using a different scale.

    Screen Shot 2021-05-17 at 2.20.39 PM.png

    To read this pictograph, all you need is the scale, which, in this case, is the number of people that each blood drop symbol represents. In this graph, each blood drop represents 5 people. There are six drops next to Type A, so \(\ 5 \cdot 6=30\) people had Type A blood. The table below shows the rest of the information.

    Blood Type Number of People
    Type A \(\ 6 \text { drops } \cdot 5 \text { people }=30 \text { people }\)
    Type B \(\ 5 \text { drops } \cdot 5 \text { people }=25 \text { people }\)
    Type AB \(\ 2 \text { drops } \cdot 5 \text { people }=10 \text { people }\)
    Type O \(\ 7 \text { drops } \cdot 5 \text { people }=35 \text { people }\)
    Example
    The pictograph below shows the number of medals earned at an international competition. How many medals did Japan earn?
    Country Medals
    Japan MMMMM
    Argentina M
    Germany MMMMMM
    Egypt MMMM
      M = 4 medals
    Solution
      Look at the scale of the pictograph. Each M represents 4 medals.
    \(\ 5 \cdot 4=20\) Japan has 5M symbols, so the total number of medals is \(\ 5 \cdot 4=20\).

    Japan earned 20 medals.

    Exercise

    Which table accurately represents the data shown in the pictograph below?

    Employee Hourly wage
    Wayne $ $ $ $ $
    Sarah $ $ $ $ $ $ $
    Leigh $ $ $ $
      $ = $4
    1. Employee Hourly wage
      Wayne $5
      Sarah $7
      Leigh $4
    2. Employee Hourly wage
      Wayne $20
      Sarah $22
      Leigh $19
    3. Employee Hourly wage
      Wayne $10
      Sarah $14
      Leigh $8
    4. Employee Hourly wage
      Wayne $20
      Sarah $28
      Leigh $16
    Answer
    1. You identified the correct number of symbols, but it looks like you forgot to factor in the scale. Each dollar sign represents $4, so multiply the number of dollar signs in a row by 4. The correct answer is Table D.
    2. You correctly identified that Wayne earns $20, since the table indicates that he has 5 dollar signs, and each dollar sign represents $4. Sarah’s row shows that she earns seven dollar sign symbols, so she earns $28, not $22. The correct answer is Table D.
    3. You identified the correct number of symbols, but it looks like you used the wrong scale. Each dollar sign represents $4, not $2. Multiply the number of dollar signs in a row by 4. The correct answer is Table D.
    4. Each dollar sign symbol represents $4, so if you multiply the number of dollar signs in a row by $4, you will find that Wayne earns $20, Sarah earns $28, and Leigh earns $16.

    Bar Graphs

    Representing data as pictures doesn’t always make sense either. Bar graphs are an alternative (and popular) way to represent data sets, especially those with large amounts of data or which do not lend themselves well to individual symbols. In a bar graph, the number of items in a data category is represented by the height or length of bars.

    As when reading pictographs, paying attention to the scale is essential: small differences in the height of two bars can sometimes represent thousands of dollars, for example!

    Let’s consider an example. Here is some information about the average life span of five animals in the wild, presented in a table.

    Source: National Geographic, accessed July 2011
    Animal Average Life Span in the Wild (Years)
    Zebra 25
    Giant Panda 20
    Cheetah 11
    Baboon 30
    African Elephant 70

    This data is fine in a table, but presenting it as a bar graph helps the viewer compare the different life spans more easily. Look at the bar graph below. In this example, the animals are listed on the left side of the graph (also called the y-axis), and the life span in years is listed on the bottom (the x-axis). The graph shows the information by the length of the bar associated with each animal name.

    Screen Shot 2021-05-17 at 3.06.13 PM.png

    Bar graphs are generally used to compare quantities, not to determine exact quantities, especially when the scale is large, as in the next graph.

    The bar graph below shows total revenue for four fictional companies in 2011. Notice that the scale, on the y-axis, is very large; each horizontal line represents an increase of $500,000. For this reason, it is difficult to tell exactly how much money each company made in 2011. However, comparing the bars is straightforward. Glancing at the data, you can tell that Lumbertown earned the most (a little over $2,500,000), while Machine Systems earned the least (about one million less, at just over $1,500,000).

    Screen Shot 2021-05-17 at 3.12.55 PM.png

    You can also use bar graphs to showcase multiple pieces of information about a specific situation. For example, let’s show the next year’s projected revenue for each company on the graph that you just looked at. You can leave the existing bars in the graph and just add four more.

    The blue columns remain, but now they are accompanied by four new red columns that represent the projected revenue for these companies. Again, this data could be expressed in a table. With a bar graph, you gain ease of quick comparison, but lose the detail of the exact values. Looking at this graph tells you that while Lumbertown has the highest revenue for 2011, it is projected to decrease. Conversely, Machine Systems is projected to increase its revenue. Seeing data visually can help you understand the story that the data is telling about a situation.

    Screen Shot 2021-05-17 at 3.13.51 PM.png

    Example
    Use the graph to list the drinks from the most number of calories to the least number of calories (serving size: 16 ounces).

    Screen Shot 2021-05-17 at 3.16.22 PM.png

    Solution

      The y-axis shows total calories, and the x-axis shows the drink. The taller the bar, the more calories the drink has.
    \(\ \text { Juice Blend } \approx 275\) The juice blend contains over 250 calories, so it has the most calories per serving.

    \(\ 2 \% \text { Milk } \approx 245\)

    \(\ \text { Soda } \approx 230\)

    Soda and 2% milk are both between 200 and 250 calories, but the bar for 2% milk is taller, so it must contain more calories.
    \(\ \text { Sports Drink } \approx 125\) Sports drink has the shortest bar; it contains about 125 calories.

    From most to least number of calories per serving:

    Juice Blend, 2% Milk, Soda, Sports Drink

    Example
    Based on the graph below, which player’s rebounding increased the most from 2009 to 2010?

    Screen Shot 2021-05-17 at 3.34.06 PM.png

    Source: WNBA.com, accessed July 2011

    Solution
    Jackson and Brunson

    The y-axis shows rebounds per game, and the x-axis shows the player’s name. A taller bar represents more rebounds per game by the player.

    This graph shows two sets of data: one for 2009, in blue, and one for 2010, in red. To compare the data from one year to the next, compare the heights of the two bars for each player.

    Two players had higher rebound averages in 2010 than they did in 2009. This is indicated by the red bar being taller than the blue bar. The other players’ red bars are shorter, so their rebounds decreased.

    Comparing the sizes of the increases, you can tell that Brunson increased her per game rebounding more than Jackson did.

    The player whose rebounding increased the most from 2009 to 2010 was Brunson.

    Histograms

    The data sets you have looked at so far have shown categorical data. Categorical data is data that can be separated into categories. For instance, data about eye color (brown, blue, green), blood type (O, A, B, AB), and the type of computer you use at work or home (PC or Mac) are all categorical.

    Quantitative data is sometimes called numerical data, because it is data that is represented by numbers. Quantitative data sets consist of quantities such as age (1, 2, 23, 34, 77...), test scores (90, 95, 100, 72...), and height (55 inches, 50 inches, 68 inches, ...). Notice that all of the data here are numbers.

    Continuous quantitative data can be graphed using a histogram. A histogram resembles a bar graph, but instead of having categories along the axis, it has numbers listed in order and usually grouped in intervals (such as 0, to 10, 11 to 20, and so on). While the bars in bar graphs can have space between the bars, the bars of a histogram are usually touching, as the data is continuous.

    Let’s look at how you could display the set of test score data listed below.

    Name Score (0 to 100)
    Alex 81
    Beatriz 73
    Celia 79
    Donnie 91
    Erykah 87
    Fred 79
    Gigi 81
    Helene 84
    Irma 88
    Joelle 96

    Looking at the scores, you can tell that some people scored in the 70s, some scored in the 80s, and some scored in the 90s. You can organize the data in a histogram with the continuous data from 0 to 100, by using the intervals to 71 to 80, 81 to 90, and 91 to 100.

    Screen Shot 2021-05-17 at 4.20.54 PM.png

    There are only three intervals for this histogram and ten data points. For a histogram to be meaningful, it should include 100 data points or more and 7 intervals or more. (For this reason, many histograms are created using a variety of technical tools.) The histogram then takes on interesting shapes that can provide a lot of information.

    Even with only 10 data points, you find a much more meaningful histogram by using smaller intervals (such as 71 to 75, 76 to 80, 81 to 85, 86 to 90, 91 to 95, and 96 to 100).

    Screen Shot 2021-05-17 at 4.24.57 PM.png

    This second graph shows a bit of a curve to the data. Nobody scored 70 or lower, and the data peaks in the 81 to 85 range. Increasing the number of intervals has an effect on the shape of the graph, and helps us see some trends that are in the data.

    Example
    Based on the histogram below, how many students are between 151 and 180 centimeters tall?

    Screen Shot 2021-05-17 at 7.51.25 PM.png

    Solution

    Number of students in each interval:  
    \(\ \begin{array}{r}
    151 \text{ to } 155,\ 1\\
    156 \text{ to }160,\ 2\\
    161 \text{ to } 165,\ 6\\
    166 \text{ to } 170,\ 2\\
    171 \text{ to }175,\ 0\\
    176 \text{ to }180,\ 0
    \end{array}\)
    Each interval in this histogram is 5 centimeters.
    \(\ 1+2+6+2+0+0=11\) Add the number of students.

    11 students are between 151 and 180 centimeters tall.

    Exercise

    A teacher made this histogram to track scores on a recent math test. How many students scored in the 91 to 100 range?

    Screen Shot 2021-05-17 at 8.00.07 PM.png

    1. 3
    2. 6
    3. 9
    4. 22
    Answer
    1. Incorrect. Three students scored between 96 to 100 on the test, but if the teacher wants to find out how many students scored between 91 to 100, then she also has to consider the number of students who scored between 91 to 95. The correct answer is 9.
    2. Incorrect. Six students scored between 91 to 95 on the test, but if the teacher wants to find out how many students scored between 91 to 100, then she also has to consider the number of students who scored between 96 to 100. The correct answer is 9.
    3. Correct. Six students scored between 91 to 95 on the test, and 3 students scored between 96 to 100. The total number of students who scored between 91 to 100 is 9.
    4. Incorrect. It looks like you calculated the total number of students who took the test. To find the number who scored between 91 to 100, look at the intervals for 91 to 95 and 96 to 100. The correct answer is 9.

    Summary

    Data is mathematical information. Mathematical data is often recorded in tables to organize, or spreadsheets to organize and sort. Graphs can help you see the data visually, which can help you to better understand the data. A pictograph is a graph that uses symbols to represent data. Bar graphs show the frequency of categorical data, using bars instead of symbols. Histograms are similar to bar graphs in that they are constructed out of a series of bars. However, histograms represent continuous quantitative (numerical) data, which can illustrate trends as well as other more advanced attributes.


    This page titled 2.7.1: Graphing Data is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by The NROC Project via source content that was edited to the style and standards of the LibreTexts platform.