Regression Analysis: A statistical technique to evaluate the relationship between a continuous dependent variable (outcome) and one or more independent variables (predictors).
Main Objectives:
Evaluate the overall model fit (like the F test in ANOVA).
Measure the unique effect of each predictor on the outcome (beta coefficient).
Types of Statistical Methods in Regression
General Linear Model:
Assumes normality of residuals.
Requires a continuous dependent variable.
Includes analyses like ANOVAs, ANCOVAs, MANOVAs, MANCOVAs.
Generalized Linear Model:
For non-continuous data (nominal and ordinal).
Includes binary and multinomial logistic regression.
Uses a link function to transform the dependent variable for linear modeling.
Choosing a Regression Model
Dependent Variable Type:
Scale variable: Use Linear Regression.
Non-scale variable: Use Generalized Linear Model approaches (e.g., logistic regression).
Linear Regression Analytical Definition
Used to determine if a set of predictor variables effectively predict a continuous outcome variable.
Example: Predicting systolic blood pressure using age and weight.
Assumptions of Linear Regression
Multicollinearity:
Predictor variables should not be highly correlated.
Assessed using Variance Inflation Factors (VIFs); values >10 indicate high multicollinearity.
Homoscedasticity:
Consistent variance of residuals across predictor values.
Tested using Levene’s test or scatter plots.
Independent Observations:
Each observation should be independent of others.
Normality:
Residuals should follow a normal distribution.
No Significant Outliers:
Outliers should be identified visually or analytically.
Formulating a Research Question for Linear Regression
Research Question: Do the independent variables predict the continuous dependent variable?
Hypotheses:
Null: Independent variables do not predict the dependent variable.
Alternative: Independent variables do predict the dependent variable.
Interpreting Linear Regression Output
Beta Coefficients (B):
Effect of predictor on the outcome variable.
Standardized Beta Coefficients (β):
Effect size in standard deviations.
Standard Error:
Uncertainty of beta coefficients.
Confidence Interval:
Range where the true beta value likely falls.
T-statistic & p-value:
Significance of predictor influence on outcome.
Regression Equation
Allows prediction of the dependent variable based on independent variables.
Example equation:
Ŷ = 13.24 + 0.50 * (self-acceptance) where Ŷ is the predicted outcome.
Writing up Regression Results
Describe the overall model fit (F statistic, p-value, R-squared).