Hypothesis Testing for Population Means

Sep 17, 2024

Lecture Notes: Testing Hypotheses About a Single Mean

Key Concepts

Objective: Make an inference about a population mean ( \mu ) given a sample.
Comparison: Population mean ( \mu ) vs. hypothesized mean ( \mu_0 ).
Assumptions: Continuous variable ( x ) is normally distributed with mean ( \mu ) and variance ( \sigma^2 ).

Test Types

1. When Population Variance is Known

Test Used: Z-test

2. When Population Variance is Unknown

Test Used: T-test
Common Scenario: More frequent as population variance is rarely known.

Hypotheses Forms

Two-sided:
- Null Hypothesis (( H_0 )): ( \mu = \mu_0 )
- Alternative Hypothesis (( H_a )): ( \mu \neq \mu_0 )
One-sided:
- If testing ( \mu < \mu_0 ), ( H_0: \mu \geq \mu_0 )
- If testing ( \mu > \mu_0 ), ( H_0: \mu \leq \mu_0 )
- Note: Null hypothesis is opposite in direction from the alternative.

Test Statistic: Z-Test

Formula: [ z = \frac{\bar{x} - \mu_0}{\frac{\sigma}{\sqrt{n}}} ]
- ( \bar{x} ): Sample mean
- ( \mu_0 ): Theorized value
- ( \sigma ): Standard deviation (known)
- ( n ): Sample size
Interpretation: Z-statistic follows the standard normal distribution under null hypothesis.

Example: IQ of LA Residents

Objective: Check if mean IQ differs from national average of 100.
Known Variance: 225 (standard deviation = 15)
Sample: 7 LA residents
Result:
- Sample mean ( = 99.5 )
- Z-statistic ( = -0.07559 )
- P-value ( = 0.93974 )
- Conclusion: Do not reject the null hypothesis.

Conclusion Statements

Components:
- Restate the test's purpose.
- Report sample statistics.
- Indicate statistical significance.
- Specify directionality.
- Report the comparator and p-value.

Example: Coin Flips by PM510 Students

Objective: Compare students' streaks of heads/tails to theoretical expectations.
Population Mean and SD: 6.977432 and 1.7926
Sample Mean: 5.112903
Result:
- Z-statistic ( = -8.189956 )
- P-value < 0.00001
- Conclusion: Reject null hypothesis.

T-statistic

Used when: Population variance unknown.
Formula: Similar to Z, but uses sample variance.
Distribution: Follows Student's t-distribution with ( n-1 ) degrees of freedom.

Example: CD4 Levels in HIV Patients

Objective: Check if mean differs from 400 (cutoff)
Sample: 10 patients
Result:
- Sample mean ( = 305.5 )
- T-statistic ( = -2.98835 )
- P-value ( = 0.01524 )
- Conclusion: Mean is statistically different from 400.

Example: White Blood Cell Count in Leukemia

Objective: Check if mean > 7500 (cutoff)
Challenge: Data not normally distributed.
Solution: Log transformation
Result:
- Sample mean (log transformed) ( = 0.6765 )
- T-statistic ( = -1.753 )
- Adjust for direction with one-sided test: P-value ( = 0.9565 )
- Conclusion: Do not reject null hypothesis.

Important Note on One-sided Tests

SPSS Caution: SPSS output for one-sided tests can be misleading.
Approach: Use two-sided p-value as a starting point and adjust based on alignment of observed vs. hypothesized means.

Summary

Z-test: For known variance, compare ( \mu ) and ( \mu_0 ).
T-test: For unknown variance, uses sample data to estimate.
P-value Significance: Helps conclude whether to accept or reject the null hypothesis.

Full transcript