πŸ“Š

Frequency Distribution Overview

Sep 4, 2025

Overview

This lecture covers how to organize and summarize collected data using frequency distributions, including definitions, step-by-step construction, and how to represent these distributions in graphs.

Frequency Distributions

  • A frequency distribution organizes data into groups (classes) and counts (frequencies).
  • Classes are groups of data values; frequencies are the counts for each class.
  • Example: Hair color grouped as blonde, brown, black with counts for each.
  • Frequency distribution shows trends and patterns not easily seen in raw data.

Constructing a Frequency Distribution

  • Determine the number of classes (groups) appropriate for the data set.
  • Calculate class width: (maximum value - minimum value) Γ· number of classes, then round up to the nearest whole number.
  • Choose a starting point (usually the minimum data value) for the first lower class limit.
  • Create classes by adding the class width to each lower class limit.
  • Upper class limit is one less than the next lower class limit.
  • Always round class width up to ensure all data is covered.

Key Components of Frequency Distributions

  • Lower class limit: Smallest value in a class.
  • Upper class limit: Largest value in a class.
  • Class midpoint: (Lower class limit + Upper class limit) Γ· 2.
  • Class boundary: Value between an upper class limit and the next lower class limit (used for histograms).

Relative and Cumulative Frequency Distributions

  • Relative frequency = class frequency Γ· total number of data items (often shown as a percentage).
  • Cumulative frequency adds up frequencies as you move through the classes, showing the running total.

Graphical Representations

  • Histograms are bar charts with touching bars, displaying frequencies or relative frequencies for each class.
  • Use class midpoints or boundaries for the horizontal axis, frequencies for the vertical.
  • Relative frequency histograms swap counts for percentages.
  • Cumulative frequency histograms plot cumulative totals.
  • Normal distribution has a central peak with data rising and then falling symmetrically; not all data sets are normal.

Key Terms & Definitions

  • Frequency Distribution β€” Table showing classes of data and corresponding counts.
  • Class β€” Group or range within a data set.
  • Class Width β€” Difference between two lower class limits.
  • Lower Class Limit β€” Smallest value in a class.
  • Upper Class Limit β€” Largest value in a class.
  • Class Midpoint β€” Average of a class’s lower and upper limits.
  • Class Boundary β€” Value between upper limit of one class and lower limit of the next, used to prevent gaps in histograms.
  • Relative Frequency β€” Proportion of data within a class (as a percentage).
  • Cumulative Frequency β€” Running total of frequencies up to a given class.
  • Histogram β€” Bar graph with adjoining bars representing frequency distributions.
  • Normal Distribution β€” Data distribution with a central peak and symmetrical rise and fall.

Action Items / Next Steps

  • Complete Section 1.5, page 34–35, problems 1, 2, and 5–25 by Monday.
  • Review how to construct and interpret frequency, relative frequency, and cumulative frequency distributions.
  • Practice making histograms from sample data using both class midpoints and boundaries.