measures a population stat…
that is different than the “null” value.
(“Null” usually being “not different than zero,” “no
effect,”
“no difference,” “no information,” etc.)
\[Y = b_{0} + b_{1}X_{1} + e\]
\[Y = b_{0} + b_{1}X_{1} + b_{2}X_{2} + e\]
\[Y = b_{0} + b_{1}X_{1} + b_{2}X_{2} + e_{1} + e_{2}\]
For now, let’s focus on …
Sources of Variance
\[Y = b_{0} + b_{1}X_{1} + ( b_{ZIP}X_{ZIP} + b_{Salary}X_{Salary} ) + e \]
\(R^2_{\text{1st Model}} \ = b_{0} + b_{1}X_{1} \qquad \qquad \qquad \qquad \qquad \qquad \quad + e\)
\(R^2_{\text{2nd Model}} = b_{0} + b_{1}X_{1} + ( b_{ZIP}X_{ZIP} + b_{Salary}X_{Salary} ) + e\)
\(\text{Difference} = R^2_{\text{2nd Model}} - R^2_{\text{1st Model}}\)
Non-Ostensible Variables
\(\text{Difference} = R^2_{\text{2nd Model}} - R^2_{\text{1st Model}}\)
\[\text{Difference in Model Fit} =\]
\[ AIC_{\text{Model without Family}} - AIC_{\text{Model with Family}}\]
\[\text{Diff. in Model Fit} = 2020 - 2000 = 20\]
\(\ \ \ \ b_{1}X_{1}\)
\(\ \ \ \ b_{1}X_{1} + ( b_{ZIP}X_{ZIP} + b_{Salary}X_{Salary} )\)
¹ ZIP could have more dfs, depending on how many ZIP codes are in that variable, but let’s assume it’s 1 dfs (thus two levels for ZIP, either dummy or just two ZIP codes.)