Coconote
AI notes
AI voice & video notes
Try for free
🌍
Hypothesis Testing in Plastic Pollution Study
Nov 18, 2024
Chapter 13: Case Study on Plastic Pollution and Hypothesis Testing
Overview
Focus on a case study regarding plastic pollution.
Introduction to hypothesis testing for sample means.
One-sample t-test: testing if a population mean equals a known constant.
Two-sample t-test: testing if two populations have the same sample mean.
Problem 1: Corporations and Plastic Pollution
Discussion on which corporations contribute the most to plastic pollution.
Hypothesis Testing and T Distribution
Review of the T distribution in hypothesis testing.
Comparison to hypothesis testing of proportions.
Introduction of null and alternate hypotheses.
Identification of assumptions for hypothesis testing.
Data Set on Plastics
Data set generated by an international volunteer effort.
Break Free from Plastic
initiative: volunteer-based brand audits.
In 2020, 15,000 volunteers from 55 countries conducted audits.
Top polluters named: Coca-Cola, PepsiCo, Nestle, Unilever, Mondelez International.
Understanding Plastic Types
Video link and data set links provided in course module.
Types of plastics: PET, PP, PS, PVC.
Recycling Codes
on plastics indicate recyclability.
Challenge: Not all plastics are recyclable multiple times.
Analyzing the Data Set
Different types of plastics listed in the data set.
Categories such as HDPE and LDPE mentioned.
Importance of understanding data headings and categories.
Dashboard Analysis
Use of a dashboard to analyze data quickly.
Selection of a country for deeper analysis.
Example with the United States:
Total plastics recorded in 2019 and 2020.
Identification of top polluters.
Problem Solving and Data Manipulation
Steps provided for deeper data analysis, if time permits.
Pre-prepared data set focusing on Coca-Cola.
Research Question: Coca-Cola's Plastic Pollution
Coca-Cola claims an average of 275 items per country.
Research question: Is the actual average different from 275?
Hypothesis Testing Steps
Part A:
Testing a mean (average) not a proportion.
Part B:
Use the T distribution for hypothesis testing.
T Distribution and Conditions
Understanding T distribution: many T distributions exist based on sample size.
Conditions for one-sample t-test:
Random sample assumption.
Sample size threshold (n ≥ 30) or normally distributed data.
Visualization
Creation of a histogram to visualize the data.
Description of the data's shape and spread.
Noted skewness and outliers in the data.
Verifying Conditions
Verification of random sampling and sample size adequacy.
Writing Hypotheses
Null and alternate hypotheses for the research question:
Null Hypothesis (H0): μ = 275
Alternate Hypothesis (HA): μ ≠ 275
Definition of μ as the average number of plastic items per country from Coca-Cola.
Two-Sample Hypothesis Example
New research question comparing 2019 and 2020 data.
Determination of a two-sample t-test.
Hypotheses involve μ1 and μ2 for different years.
μ1 for 2019 and μ2 for 2020.
Conclusion
Overview of the statistical approach to addressing real-world issues like plastic pollution.
Encouragement to use reputable sources and understand data collection methods.
📄
Full transcript