Standard Deviation (s): Square root of the variance, representing the average distance from the mean.
Example: Given data: 120, 80, 90, 110, 95.
Mean (X̄) = 99 mmHg.
Variance (s²) = 255 mmHg².
Standard Deviation (s) = √255 ≈ 16 mmHg.
Conclusion: Larger s indicates more spread in data.
Population vs. Sample
Sample: Subset of a population used to estimate population parameters.
Population: Entire group of interest.
Sample Statistics: Estimates of population parameters (e.g., mean, standard deviation).
Population Parameters: True values of interest (e.g., population mean (µ), population standard deviation (σ)).
Representation: Simple random sampling ensures every subset is equally likely to be chosen.
Importance of Sample Size
Effect on Estimates: Larger samples provide better estimates of population parameters.
Sample Statistics: Mean and standard deviation depend on sample size for accuracy.
Predictability: Larger samples do not predict direction (increase or decrease) of sample estimates but provide a better approximation to the population parameters.
Degrees of Freedom
Explanation: The number of values that can vary in a dataset after a constraint (e.g., mean) is imposed.
Example: If sample mean is known and n readings are provided, only n-1 values are free to vary.
Division by n-1: Corrects bias in estimate of population variance from a sample (Bessel's correction).
Range vs. Standard Deviation
Range: Difference between maximum and minimum values.
Limitation: Range increases with sample size due to the likelihood of extreme values.
Standard Deviation: Provides a consistent measure of variability regardless of sample size.
Conclusion
Summary Measures: Mean, median, and standard deviation are key summaries but do not alone tell the whole story of continuous data.
Next Steps: Explore visualization of continuous data to understand the overall distribution and shape.