๐Ÿ“Š

Expected Counts in Contingency Tables

Aug 13, 2025

Overview

This lecture explains how to calculate and interpret expected counts in contingency tables, particularly in the context of a chi-square test for independence using an experiment on music and memory retention.

Experimental Design and Observed Counts

  • Students attempted to memorize information under different music conditions: liked music, disliked music, or no music.
  • Confounding variables (like music volume) were controlled, and students were randomly assigned to groups.
  • Students were classified as having "high retention" or "low retention" based on their memory performance.
  • The observed counts for each group were provided (e.g., 10, 14, 11, 15, 18, 7).

Calculating Expected Counts

  • If the null hypothesis (music does not affect retention) is true, retention rates should be similar across all groups.
  • Calculate overall high retention rate: 39/75 = 0.52 (52%); low retention: 36/75 = 0.48 (48%).
  • Expected count for each cell = percentage (from the whole group) ร— group size (e.g., 0.52 ร— 24 = 12.48 for high retention, liked music group).
  • Repeat this for each combination of group and retention type.

The Expected Count Formula

  • Alternative formula: (row total ร— column total) รท grand total.
  • This formula accounts for different group sizes and matches the previous method when computed.
  • Used in most statistical software for computing expected counts in contingency tables.

Computing the Chi-Square Statistic

  • For each cell: (observed count โ€“ expected count)ยฒ รท expected count.
  • Sum these values for all cells to get the chi-square test statistic.
  • Ensure observed and expected counts are matched correctly for each cell.

Table Structure and Degrees of Freedom

  • The example uses a 2ร—3 contingency table (two retention levels, three music groups, not counting totals).
  • Degrees of freedom = (number of rows โ€“ 1) ร— (number of columns โ€“ 1) = 2โ€“1 ร— 3โ€“1 = 2.

Key Terms & Definitions

  • Observed Count โ€” The actual number of cases counted in each category of the contingency table.
  • Expected Count โ€” The number of cases expected in each category if the null hypothesis is true.
  • Null Hypothesis โ€” The assumption that variables (e.g., music and retention) are independent.
  • Chi-Square Statistic โ€” A sum of squared differences between observed and expected counts, divided by expected count.
  • Degrees of Freedom โ€” (Rows โ€“ 1) ร— (Columns โ€“ 1), used in significance testing.

Action Items / Next Steps

  • Practice calculating expected counts and chi-square statistics using sample tables.
  • Review how to determine degrees of freedom for contingency tables.
  • Prepare for problems applying these concepts to different experimental designs.