Overview
This lecture covers how to find and interpret the five-number summary and use it to identify potential outliers in skewed data sets using the concept of fences and the interquartile range (IQR).
Five-Number Summary
- The five-number summary includes: minimum, Q1 (first quartile), median, Q3 (third quartile), and maximum.
- To compute it, enter data into your calculator, use "Stat" > "Calc" > "1-VarStat," and scroll to find the five-number summary.
- Example: For quiz scores 28, 24, 27, 30, 19, 20, the five-number summary is 19, 20, 25.5, 28, 30.
Outliers and Fences
- Outliers are data values significantly outside the rest of the data.
- Z-scores and the empirical rule work for symmetric data; for skewed data or with outliers, use fences based on quartiles.
- The interquartile range (IQR) is calculated as Q3 - Q1.
- The lower fence is Q1 - 1.5 * IQR; values below this are potential outliers.
- The upper fence is Q3 + 1.5 * IQR; values above this are potential outliers.
Identifying Outliers: Example
- Given Q1 = 1.61, Q3 = 2.68, minimum = 1.01, maximum = 6.81, median = 2.65.
- IQR = 2.68 - 1.61 = 1.07.
- Lower fence: 1.61 - 1.5 * 1.07 = 0.005; no data values fall below this, so no lower outliers.
- Upper fence: 2.68 + 1.5 * 1.07 = 4.285; values 5.22 and 6.81 are above this, so they are outliers.
Key Terms & Definitions
- Five-number summary — Set of five descriptive statistics: minimum, Q1, median, Q3, maximum.
- Quartile (Q1/Q3) — Values splitting data into quarters; Q1 is 25th percentile, Q3 is 75th percentile.
- Median — Middle value of an ordered data set.
- Interquartile range (IQR) — Difference between Q3 and Q1, measuring data spread.
- Fence — Boundaries (Q1 - 1.5IQR and Q3 + 1.5IQR) used to identify outliers.
- Outlier — Data value outside the fences, considered unusually high or low.
Action Items / Next Steps
- Practice finding the five-number summary and fences using your calculator for different data sets.
- Identify any outliers using the lower and upper fence methods.
- Review section 3.5 for more details on outlier detection.