3.2: Multivariate OLS Estimators: Bias, Precision, and Fit

ECON 480 · Econometrics · Fall 2019

Ryan Safner
Assistant Professor of Economics
safner@hood.edu
ryansafner/metricsf19
metricsF19.classes.ryansafner.com

The Multivariate OLS Estimators

By analogy, we still focus on the ordinary least squares (OLS) estimators of the unknown population parameters which solves:

Again, OLS estimators are chosen to minimize the sum of squared errors (SSE)
- i.e. the sum of the squared distances between the actual values of and the predicted values

The Multivariate OLS Estimators: FYI

Math FYI: in linear algebra terms, a regression model with observations of independent variables:

The Multivariate OLS Estimators: FYI

Math FYI: in linear algebra terms, a regression model with observations of independent variables:

The OLS estimator for is

The Multivariate OLS Estimators: FYI

Math FYI: in linear algebra terms, a regression model with observations of independent variables:

The OLS estimator for is
Appreciate that I am saving you from such sorrow

The Sampling Distribution of

The Sampling Distribution of I

For any individual , it has a sampling distribution:

We want to know its sampling distribution's:
- Center: ; what is the expected value of our estimator?
- Spread: ; how precise is our estimator?

The Expected Value of : Bias

Exogeneity and Unbiasedness

As before, when is exogenous (i.e. )
We know the true
If is endogenous (i.e. ), contains omitted variable bias
We can now try to quantify the omitted variable bias

Measuring Omitted Variable Bias I

Suppose the true population model of a relationship is:

What happens when we run a regression and omit ?

Measuring Omitted Variable Bias I

Suppose the true population model of a relationship is:

What happens when we run a regression and omit ?
Suppose we estimate the following omitted regression of just on (omitting :⁺

⁺ Note: I am using 's and only to denote these are different estimates than the true model 's and

Measuring Omitted Variable Bias IIKey Question: are X1i and X2i correlated?
   

Measuring Omitted Variable Bias II

Key Question: are and correlated?
Run an auxiliary regression of on to see:⁺

Measuring Omitted Variable Bias II

Key Question: are and correlated?
Run an auxiliary regression of on to see:⁺

If , then and are not linearly related
If is very big, then and are strongly linearly related

⁺ Note: I am using 's and to differentiate estimates for this model.

Measuring Omitted Variable Bias III

Now substitute our auxiliary regression between X2i and X1i into the true model:
- We know

Measuring Omitted Variable Bias III

Now substitute our auxiliary regression between X2i and X1i into the true model:
- We know

Measuring Omitted Variable Bias III

Now substitute our auxiliary regression between X2i and X1i into the true model:
- We know

Measuring Omitted Variable Bias III

Now substitute our auxiliary regression between X2i and X1i into the true model:
- We know

Now relabel each of the three terms as the OLS estimates 's) and error from the omitted regression, so we again have:

Measuring Omitted Variable Bias III

Now substitute our auxiliary regression between X2i and X1i into the true model:
- We know

Now relabel each of the three terms as the OLS estimates 's) and error from the omitted regression, so we again have:

Crucially, this means that our OLS estimate for in the omitted regression is:

Measuring Omitted Variable Bias IV

The Omitted Regression OLS estimate for picks up both:

The true effect of on :
The true effect of on :
- As pulled through the relationship between and :

Measuring Omitted Variable Bias IV

The Omitted Regression OLS estimate for picks up both:

The true effect of on :
The true effect of on :
- As pulled through the relationship between and :

Recall our conditions for omitted variable bias from some variable :

must be a determinant of
must be correlated with

Measuring Omitted Variable Bias IV

The Omitted Regression OLS estimate for picks up both:

The true effect of on :
The true effect of on :
- As pulled through the relationship between and :

Recall our conditions for omitted variable bias from some variable :

must be a determinant of
must be correlated with

Otherwise, if does not fit these conditions, and the omitted regression is unbiased!

Measuring Omitted Variable Bias in Our Class Size Example I

## 
## Call:
## lm(formula = testscr ~ str + el_pct, data = CASchool)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -48.845 -10.240  -0.308   9.815  43.461 
## 
## Coefficients:
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 686.03225    7.41131  92.566  < 2e-16 ***
## str          -1.10130    0.38028  -2.896  0.00398 ** 
## el_pct       -0.64978    0.03934 -16.516  < 2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 14.46 on 417 degrees of freedom
## Multiple R-squared:  0.4264,    Adjusted R-squared:  0.4237 
## F-statistic:   155 on 2 and 417 DF,  p-value: < 2.2e-16

The "True" Regression on and

Measuring Omitted Variable Bias in Our Class Size Example II

## 
## Call:
## lm(formula = testscr ~ str, data = CASchool)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -47.727 -14.251   0.483  12.822  48.540 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 698.9330     9.4675  73.825  < 2e-16 ***
## str          -2.2798     0.4798  -4.751 2.78e-06 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 18.58 on 418 degrees of freedom
## Multiple R-squared:  0.05124,    Adjusted R-squared:  0.04897 
## F-statistic: 22.58 on 1 and 418 DF,  p-value: 2.783e-06

The "Omitted" Regression on just

Measuring Omitted Variable Bias in Our Class Size Example III

## 
## Call:
## lm(formula = el_pct ~ str, data = CASchool)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -20.823 -13.006  -6.849   7.834  74.601 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept) -19.8541     9.1626  -2.167  0.03081 *  
## str           1.8137     0.4644   3.906  0.00011 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 17.98 on 418 degrees of freedom
## Multiple R-squared:  0.03521,    Adjusted R-squared:  0.0329 
## F-statistic: 15.25 on 1 and 418 DF,  p-value: 0.0001095

The "Auxiliary" Regression on

Measuring Omitted Variable Bias in Our Class Size Example IV

"True" Regression

"Omitted" Regression

"Auxiliary" Regression

Omitted Regression estimate for on STR is

Measuring Omitted Variable Bias in Our Class Size Example IV

"True" Regression

"Omitted" Regression

"Auxiliary" Regression

Omitted Regression estimate for on STR is

The true effect of STR on Test Score: -1.10

Measuring Omitted Variable Bias in Our Class Size Example IV

"True" Regression

"Omitted" Regression

"Auxiliary" Regression

Omitted Regression estimate for on STR is

The true effect of STR on Test Score: -1.10
The true effect of %EL on Test Score: -0.65

Measuring Omitted Variable Bias in Our Class Size Example IV

"True" Regression

"Omitted" Regression

"Auxiliary" Regression

1.81

Omitted Regression estimate for on STR is

The true effect of STR on Test Score: -1.10
The true effect of %EL on Test Score: -0.65
The relationship between STR and %EL: 1.81

Measuring Omitted Variable Bias in Our Class Size Example IV

"True" Regression

"Omitted" Regression

"Auxiliary" Regression

1.81

Omitted Regression estimate for on STR is

The true effect of STR on Test Score: -1.10
The true effect of %EL on Test Score: -0.65
The relationship between STR and %EL: 1.81
So, for the omitted regression:

Measuring Omitted Variable Bias in Our Class Size Example IV

"True" Regression

"Omitted" Regression

"Auxiliary" Regression

1.81

Omitted Regression estimate for on STR is

The true effect of STR on Test Score: -1.10
The true effect of %EL on Test Score: -0.65
The relationship between STR and %EL: 1.81
So, for the omitted regression:

The bias is

Precision of

Precision of I

σ^βj; how precise are our estimates? (today)
- Variance or standard error

Precision of II

Variation in is affected by four things now:

Goodness of fit of the model
- Larger larger
Sample size,
- Larger smaller
Variance of
- Larger smaller
Variance Inflation Factor 1(1−R2j)
- Larger , larger
- This is the only new effect, explained in a moment

⁺ See Class 2.5 for a reminder of variation with just variable.

VIF and Multicollinearity I

Two independent variables are multicollinear:

VIF and Multicollinearity I

Two independent variables are multicollinear:

Multicollinearity between X variables does not bias OLS estimates
- Remember, we pulled another variable out of into the regression
- If it were omitted, then it would cause omitted variable bias!

VIF and Multicollinearity I

Two independent variables are multicollinear:

Multicollinearity between X variables does not bias OLS estimates
- Remember, we pulled another variable out of into the regression
- If it were omitted, then it would cause omitted variable bias!
Multicollinearity does increase the variance of an estimate by

VIF and Multicollinearity II

is the from an auxiliary regression of on all other regressors 's)

VIF and Multicollinearity II

is the from an auxiliary regression of on all other regressors 's)

Example: Suppose we have a regression with three regressors :

VIF and Multicollinearity II

is the from an auxiliary regression of on all other regressors 's)

Example: Suppose we have a regression with three regressors :

There will be three different 's, one for each regressor:

VIF and Multicollinearity III

is the from an auxiliary regression of on all other regressors 's)
The tells us how much other regressors explain regressor
Key Takeaway: If other variables explain well (high ), it will be harder to tell how cleanly , and so will be higher

VIF and Multicollinearity IV

Common to calculate the Variance Inflation Factor (VIF) for each regressor:

VIF quantifies the factor (scalar) by which var(^βj) increases because of multicollinearity
- e.g. increases 2x, 3x, etc.

VIF and Multicollinearity IV

Common to calculate the Variance Inflation Factor (VIF) for each regressor:

VIF quantifies the factor (scalar) by which var(^βj) increases because of multicollinearity
- e.g. increases 2x, 3x, etc.
Baseline: no multicollinearity VIF = 1 (no inflation)

VIF and Multicollinearity IV

Common to calculate the Variance Inflation Factor (VIF) for each regressor:

VIF quantifies the factor (scalar) by which var(^βj) increases because of multicollinearity
- e.g. increases 2x, 3x, etc.
Baseline: no multicollinearity VIF = 1 (no inflation)
Larger larger VIF
- Rule of thumb: is worrisome

VIF and Multicollinearity V

ggplot(data=CASchool, aes(x=str,y=el_pct))+
  geom_point(color="blue")+
  geom_smooth(method="lm", color="red")+
  labs(x = "Student to Teacher Ratio",
       y = "Percentage of ESL Students")+
    theme_classic(base_family = "Fira Sans Condensed",
           base_size=20)

CAcorr_ex<-subset(CASchool, select=c("testscr", "str", "el_pct"))
# Make a correlation table
cor(CAcorr_ex)

##            testscr        str     el_pct
## testscr  1.0000000 -0.2263628 -0.6441237
## str     -0.2263628  1.0000000  0.1876424
## el_pct  -0.6441237  0.1876424  1.0000000

VIF and Multicollinearity in R I

# our multivariate regression
elreg<-lm(testscr~str+el_pct,data=CASchool)
# use the "car" package for VIF function 
library("car") 
# syntax: vif(lm.object)
vif(elreg)

##      str   el_pct 
## 1.036495 1.036495

VIF and Multicollinearity in R I

# our multivariate regression
elreg<-lm(testscr~str+el_pct,data=CASchool)
# use the "car" package for VIF function 
library("car") 
# syntax: vif(lm.object)
vif(elreg)

##      str   el_pct 
## 1.036495 1.036495

on str increases by 1.036 times due to multicollinearity with el_pct
on el_pct increases by 1.036 times due to multicollinearity with str

VIF and Multicollinearity in R IILet's calculate VIF manually to see where it comes from:
   

VIF and Multicollinearity in R II

Let's calculate VIF manually to see where it comes from:

# run auxiliary regression of x2 on x1
auxreg<-lm(el_pct~str, data=CASchool)
# use broom package's tidy() command (cleaner)
library(broom) # load broom
tidy(auxreg) # look at reg output

ABCDEFGHIJ0123456789

term <chr>	estimate <dbl>	std.error <dbl>	statistic <dbl>	p.value <dbl>
(Intercept)	-19.854055	9.1626044	-2.166857	0.0308099863
str	1.813719	0.4643735	3.905733	0.0001095165

VIF and Multicollinearity in R III

glance(auxreg) # look at aux reg stats for R^2

ABCDEFGHIJ0123456789

r.squared <dbl>	adj.r.squared <dbl>	sigma <dbl>	statistic <dbl>	p.value <dbl>	df <int>	logLik <dbl>	AIC <dbl>
0.03520966	0.03290155	17.98259	15.25475	0.0001095165	2	-1808.502	3623.003

# extract our R-squared from aux regression (R_j^2)
aux_r_squared<-glance(auxreg) %>%
  pull(r.squared)
aux_r_squared # look at it

## [1] 0.03520966

VIF and Multicollinearity in R IV

# calculate VIF manually
our_vif<-1/(1-aux_r_squared) # VIF formula 
our_vif

## [1] 1.036495

Again, multicollinearity between the two variables inflates the variance on each by 1.036 times

VIF and Multicollinearity: Another Example I

Example: For our Test Scores and Class Size example, what about district expenditures per student?

CAcorr2<-subset(CASchool, select=c("testscr", "str", "expn_stu"))
# Make a correlation table
corr2<-cor(CAcorr2)
# look at it
corr2

##             testscr        str   expn_stu
## testscr   1.0000000 -0.2263628  0.1912728
## str      -0.2263628  1.0000000 -0.6199821
## expn_stu  0.1912728 -0.6199821  1.0000000

VIF and Multicollinearity: Another Example II

ggplot(data=CASchool, aes(x=str,y=expn_stu))+
  geom_point(color="blue")+
  geom_smooth(method="lm", color="red")+
  scale_y_continuous(labels = scales::dollar)+
  labs(x = "Student to Teacher Ratio",
       y = "Expenditures per Student ($)")+
    theme_classic(base_family = "Fira Sans Condensed",
           base_size=20)

VIF and Multicollinearity: Another Example III

Omitting will bias on STR

VIF and Multicollinearity: Another Example III

Omitting will bias on STR
Including will not bias on STR, but will make it less precise (higher variance)

VIF and Multicollinearity: Another Example III

Omitting will bias on STR
Including will not bias on STR, but will make it less precise (higher variance)
Data tells us little about the effect of a change in holding constant
- Hard to know what happens to test scores when high AND high and vice versa (they rarely happen simultaneously)!

VIF and Multicollinearity: Another Example IVexpreg<-lm(testscr~str+expn_stu, data=CASchool)
summary(expreg)

## 
## Call:
## lm(formula = testscr ~ str + expn_stu, data = CASchool)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -47.507 -14.403   0.407  13.195  48.392 
## 
## Coefficients:
##               Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 675.577174  19.562222  34.535   <2e-16 ***
## str          -1.763216   0.610914  -2.886   0.0041 ** 
## expn_stu      0.002487   0.001823   1.364   0.1733    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 18.56 on 417 degrees of freedom
## Multiple R-squared:  0.05545,    Adjusted R-squared:  0.05092 
## F-statistic: 12.24 on 2 and 417 DF,  p-value: 6.824e-06
vif(expreg)

##      str expn_stu 
## 1.624373 1.624373
Including expn_stu increases variance of ^β1 by 1.62 times 

   

Multicollinearity Increases Variancelibrary(huxtable)
huxreg("Model 1" = school_reg,
       "Model 2" = expreg,
       coefs = c("Intercept" = "(Intercept)",
                 "Class Size" = "str",
                 "Expenditures per Student" = "expn_stu"),
       statistics = c("N" = "nobs",
                      "R-Squared" = "r.squared",
                      "SER" = "sigma"),
       number_format = 2)

We can see SE(^β1) on str increases from 0.48 to 0.61 when we add expn_stu 




Model 1
Model 2

Intercept
698.93 ***
675.58 ***


(9.47)   
(19.56)   

Class Size
-2.28 ***
-1.76 ** 


(0.48)   
(0.61)   

Expenditures per Student
       
0.00    


       
(0.00)   

N
420       
420       

R-Squared
0.05    
0.06    

SER
18.58    
18.56    

 *** p < 0.001;  ** p < 0.01;  * p < 0.05.




   

Perfect MulticollinearityPerfect multicollinearity is when a regressor is an exact linear function of (an)other regressor(s)
   

Perfect Multicollinearity

Perfect multicollinearity is when a regressor is an exact linear function of (an)other regressor(s)

Perfect Multicollinearity

Perfect multicollinearity is when a regressor is an exact linear function of (an)other regressor(s)

Perfect Multicollinearity

Perfect multicollinearity is when a regressor is an exact linear function of (an)other regressor(s)

Perfect Multicollinearity

Perfect multicollinearity is when a regressor is an exact linear function of (an)other regressor(s)

is implying and !

Perfect Multicollinearity

Perfect multicollinearity is when a regressor is an exact linear function of (an)other regressor(s)

is implying and !
This is fatal for a regression
- A logical impossiblity, always caused by human error

Perfect Multicollinearity: Example

Example:

%EL: the percentage of students learning English
%ES: the percentage of students fluent in English

Perfect Multicollinearity Example II

# generate %EF variable from %EL
CASchool_ex <- CASchool %>%
  mutate(ef_pct = 100 - el_pct)
CASchool_ex %>%
  summarize(cor = cor(ef_pct, el_pct))

cor

-1

Perfect Multicollinearity Example III

ggplot(data=CASchool_ex, aes(x=el_pct,y=ef_pct))+
  geom_point(color="blue")+
  scale_y_continuous(labels = scales::dollar)+
  labs(x = "Percent of ESL Students",
       y = "Percent of Non-ESL Students")+
    theme_classic(base_family = "Fira Sans Condensed",
           base_size=20)

Perfect Multicollinearity Example IVmcreg<-lm(testscr~str+el_pct+ef_pct, data=CASchool_ex)
summary(mcreg)

## 
## Call:
## lm(formula = testscr ~ str + el_pct + ef_pct, data = CASchool_ex)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -48.845 -10.240  -0.308   9.815  43.461 
## 
## Coefficients: (1 not defined because of singularities)
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 686.03225    7.41131  92.566  < 2e-16 ***
## str          -1.10130    0.38028  -2.896  0.00398 ** 
## el_pct       -0.64978    0.03934 -16.516  < 2e-16 ***
## ef_pct             NA         NA      NA       NA    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 14.46 on 417 degrees of freedom
## Multiple R-squared:  0.4264,    Adjusted R-squared:  0.4237 
## F-statistic:   155 on 2 and 417 DF,  p-value: < 2.2e-16
Note R ignores one of the multicollinear regressors (ef_pct) if you include both in a regression

   

A Summary of Multivariate OLS Estimator Properties

on is biased only if there is an omitted variable such that:
- If is included and is collinear with , this does not cause a bias
and measure precision of estimate:

A Summary of Multivariate OLS Estimator Properties

on is biased only if there is an omitted variable such that:
- If is included and is collinear with , this does not cause a bias
and measure precision of estimate:

VIF from multicollinearity: 1(1−R2j)
- for auxiliary regression of on all other 's
- mutlicollinearity does not bias but raises its variance
- perfect multicollinearity if 's are linear function of others

Updated Measures of Fit

(Updated) Measures of Fit

Again, how well does a linear model fit the data?
How much variation in is "explained" by variation in the model ($\hat{Y_i}$)?

(Updated) Measures of Fit

Again, how well does a linear model fit the data?
How much variation in is "explained" by variation in the model ($\hat{Y_i}$)?

(Updated) Measures of Fit: SER

Again, the Standard errror of the regression (SER) estimates the standard error of

A measure of the spread of the observations around the regression line (in units of ), the average "size" of the residual
Only new change: divided by due to use of degrees of freedom to first estimate and then all of the other 's for the number of regressors¹

¹ Again, because your textbook defines (k) as including the constant, the denominator would be (n-k) instead of (n-k-1).

(Updated) Measures of Fit:

Again, is fraction of variation of the model ("explained SS") to the variation of observations of ("total SS")

(Updated) Measures of Fit: Adjusted

Problem: of a regression increases every time a new variable is added (it reduces SSE!)
This does not mean adding a variable improves the fit of the model per se, gets inflated

(Updated) Measures of Fit: Adjusted

Problem: of a regression increases every time a new variable is added (it reduces SSE!)
This does not mean adding a variable improves the fit of the model per se, gets inflated
We correct for this effect with the adjusted :

There are different methods to compute ˉR2, and in the end, recall R2 was never very useful, so don't worry about knowing the formula
- Large sample sizes make and very close

In R (base)## 
## Call:
## lm(formula = testscr ~ str + el_pct, data = CASchool)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -48.845 -10.240  -0.308   9.815  43.461 
## 
## Coefficients:
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 686.03225    7.41131  92.566  < 2e-16 ***
## str          -1.10130    0.38028  -2.896  0.00398 ** 
## el_pct       -0.64978    0.03934 -16.516  < 2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 14.46 on 417 degrees of freedom
## Multiple R-squared:  0.4264,    Adjusted R-squared:  0.4237 
## F-statistic:   155 on 2 and 417 DF,  p-value: < 2.2e-16
Base R2 (R calls it multiple R-squared) went up 
Adjusted R-squared went down 

   

In R (broom)elreg %>%
  glance()



r.squared
adj.r.squared
sigma
statistic
p.value
df
logLik
AIC
BIC
deviance
df.residual

0.426
0.424
14.5
155
4.62e-51
3
-1.72e+03
3.44e+03
3.46e+03
8.72e+04
417


   

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

	Model 1	Model 2
Intercept	698.93 ***	675.58 ***
	(9.47)	(19.56)
Class Size	-2.28 ***	-1.76 **
	(0.48)	(0.61)
Expenditures per Student		0.00
		(0.00)
N	420	420
R-Squared	0.05	0.06
SER	18.58	18.56
* p < 0.001; p < 0.01; * p < 0.05.

r.squared	adj.r.squared	sigma	statistic	p.value	df	logLik	AIC	BIC	deviance	df.residual
0.426	0.424	14.5	155	4.62e-51	3	-1.72e+03	3.44e+03	3.46e+03	8.72e+04	417

3.2: Multivariate OLS Estimators: Bias, Precision, and Fit

ECON 480 · Econometrics · Fall 2019

Ryan Safner Assistant Professor of Economics safner@hood.edu ryansafner/metricsf19 metricsF19.classes.ryansafner.com

The Multivariate OLS Estimators

The Multivariate OLS Estimators

The Multivariate OLS Estimators: FYI

The Multivariate OLS Estimators: FYI

The Multivariate OLS Estimators: FYI

The Sampling Distribution of ^βj

The Sampling Distribution of ^βj I

The Expected Value of ^βj: Bias

Exogeneity and Unbiasedness

Measuring Omitted Variable Bias I

Measuring Omitted Variable Bias I

Measuring Omitted Variable Bias II

Measuring Omitted Variable Bias II

Measuring Omitted Variable Bias II

Measuring Omitted Variable Bias III

Measuring Omitted Variable Bias III

Measuring Omitted Variable Bias III

Measuring Omitted Variable Bias III

Measuring Omitted Variable Bias III

Measuring Omitted Variable Bias IV

Measuring Omitted Variable Bias IV

Measuring Omitted Variable Bias IV

Measuring Omitted Variable Bias in Our Class Size Example I

Measuring Omitted Variable Bias in Our Class Size Example II

Measuring Omitted Variable Bias in Our Class Size Example III

Measuring Omitted Variable Bias in Our Class Size Example IV

Measuring Omitted Variable Bias in Our Class Size Example IV

Measuring Omitted Variable Bias in Our Class Size Example IV

Measuring Omitted Variable Bias in Our Class Size Example IV

Measuring Omitted Variable Bias in Our Class Size Example IV

Measuring Omitted Variable Bias in Our Class Size Example IV

Precision of ^βj

Precision of ^βj I

Precision of ^βj II

VIF and Multicollinearity I

VIF and Multicollinearity I

VIF and Multicollinearity I

VIF and Multicollinearity II

VIF and Multicollinearity II

VIF and Multicollinearity II

VIF and Multicollinearity III

VIF and Multicollinearity IV

VIF and Multicollinearity IV

VIF and Multicollinearity IV

VIF and Multicollinearity V

VIF and Multicollinearity in R I

VIF and Multicollinearity in R I

VIF and Multicollinearity in R II

VIF and Multicollinearity in R II

VIF and Multicollinearity in R III

VIF and Multicollinearity in R IV

VIF and Multicollinearity: Another Example I

VIF and Multicollinearity: Another Example II

VIF and Multicollinearity: Another Example III

VIF and Multicollinearity: Another Example III

VIF and Multicollinearity: Another Example III

VIF and Multicollinearity: Another Example III

VIF and Multicollinearity: Another Example IV

Multicollinearity Increases Variance

Perfect Multicollinearity

Perfect Multicollinearity

Perfect Multicollinearity

Perfect Multicollinearity

Perfect Multicollinearity

Perfect Multicollinearity

Perfect Multicollinearity: Example

Perfect Multicollinearity Example II

Perfect Multicollinearity Example III

Perfect Multicollinearity Example IV

A Summary of Multivariate OLS Estimator Properties

A Summary of Multivariate OLS Estimator Properties

A Summary of Multivariate OLS Estimator Properties

Updated Measures of Fit

(Updated) Measures of Fit

(Updated) Measures of Fit

(Updated) Measures of Fit: SER

(Updated) Measures of Fit: R2

Ryan Safner
Assistant Professor of Economics
safner@hood.edu
ryansafner/metricsf19
metricsF19.classes.ryansafner.com

The Sampling Distribution of

The Sampling Distribution of I

The Expected Value of : Bias

Precision of

Precision of I

Precision of II

(Updated) Measures of Fit:

(Updated) Measures of Fit: Adjusted

(Updated) Measures of Fit: Adjusted