📊

Applied Statistics: Data Process and Representation

Mar 12, 2025

Applied Videos - Stats Year 1: Data Collection, Processing, and Representation

Overview

  • Covers first-year applied statistics: chapters 1, 2, and 3.
  • Focus on data collecting, processing, and representing.
  • Combined chapters due to overlap and common exam questions.

Chapter 1: Data Collection

Key Terminologies

  • Population: Entire set of items; items are sampling units.
  • Census: Measures every member of a population; accurate but costly and time-consuming.
    • Disadvantage: Testing can destroy the item.
  • Sampling Frame: List of sampling units; not always available.

Sampling Methods

Random Sampling

  • Simple Random Sampling: Equal chance for each unit; requires sampling frame.
  • Systematic Sampling: Selects every Kth unit; requires sampling frame.
  • Stratified Sampling: Reflects population groups; needs clear strata and sampling frame.

Non-Random Sampling

  • Opportunity Sampling: Based on availability; easy but may not be representative.
  • Quota Sampling: Fills quotas through availability; not necessarily representative.

Data Types

  • Qualitative Data: Non-numerical (e.g., colors, words).
  • Quantitative Data: Numerical data.
    • Discrete: Can take certain values (e.g., shoe sizes).
    • Continuous: Can take any value within a range (e.g., foot length).

Exam Questions Examples

  • Identifying sampling methods based on scenarios.
  • Application of sampling techniques considering the presence of a sampling frame.

Chapter 2: Data Processing

Measures of Location

  • Median: Position is number of terms divided by two.
  • Quartiles: Q1 (lower), Q3 (upper); position determined by dividing data.
  • Percentiles: Calculated similarly to quartiles.
  • Mean: Sum of values divided by number of values.

Measures of Spread

  • Variance and Standard Deviation: Measures spread of data.
    • Formulas: Mean of squares minus square of the mean.

Coding

  • Adjusting mean and standard deviation by transforming data.

Example Problem

  • Use of cumulative frequency and linear interpolation for finding measures like the interquartile range and percentiles.

Chapter 3: Data Representation

Focus on Histograms

  • Histograms: Area is proportional to frequency; scaling factor involved.
  • Integration with Curves: Finding area under a frequency polygon or curve equals frequency.

Practical Application

  • Completing histograms with given data.
  • Use of calculators for statistical calculations (mean, standard deviation).

Summary Tips

  • Understand how data collection concepts integrate with data processing and representation.
  • Practice exam questions to reinforce learning and application skills.
  • Use technology, like calculators, to streamline calculations and ensure accuracy.