Table of contents
1. Fixed Effects Model
1.1 Motivation
- The fixed effects model aims to address a class of endogeneity issues characterized by the presence of unobserved variables that are correlated with both the independent and dependent variables, and are assumed to remain constant over time.
- For example, different states in America may have different age distributions, but the age distribution within each state tends to be relatively stable and does not change over time.
- Failure to include these variables may lead to omitted variable bias.
- We need panel data in this case.
1.2 Model Specification
- In this model, represents unobserved time-invariant variables, which are also referred to as the fixed effect in the context of fixed effects models. is called the idiosyncratic error, which represents unobserved time-variant variables.
- Model settings: , leading to .
- Next: how to estimate in this case? what additional assumptions are needed? We first illustrate the estimation process, which is application-oriented, and then briefly discuss the assumptions required, focusing on the theoretical aspects.
1.3 Model Estimation
Two different estimation processes lead to two different but equivalent estimators.
1) Within Estimator
Let:
Then:
Hence, we can regress on . The fixed effects estimator is also called the within estimator.
2) LSDV Estimator
We can also formulate the model into following form:
In this case, we can regress on . This is called the least squares dummy variables regression (LSDV) . It can be proved that within estimator is identical to LSDV estimator.
1.4 Strong Exogeneity
In order to get consistent estimators in FEM case, need to be strictly exogenous:
This is a pretty strong assumption!
1.5 Time Fixed Effects & Entity Fixed Effects
What we have discussed pertains to what we call entity fixed effects, where the fixed effects remain constant over time but vary across entities. It can be expressed as in the model. Another type of fixed effects is time fixed effects, where the fixed effects remain constant across entities but vary over time. The model just changes the entity fixed effects into time fixed effects :
The estimation approach is identical to that of the model with entity fixed effects.
However, we should remember that although we employ an entity and time fixed effects model, we are still unable to completely eliminate all omitted variable bias, as there may still be some omitted variables that vary across entities and change over time.
1.6 R Example
- 10.1 Panel Data ~ 10.6 Drunk Driving Laws and Traffic Deaths
- 10.7 Exercises | Introduction to Econometrics with R
2. Random Effects Model
2.1 Motivation
- The random effects model aims to address the issue of autocorrelated error terms, which arises because the error term includes one component that does not vary over time, leading to correlation of the error terms within the same entity.
- Autocorrelated error terms do not affect unbiasedness or consistency, but it does make the standard errors of OLS estimates and other test statistics invalid, thereby rendering various statistical tests unreliable.
2.2 Model Specification
The model form is the same as that of the fixed effects model:
However, the settings are quite different:
- In this model, also means unobserved time-invariant variables but it represents one part of the error term ——the time-invariant part. represents the time-variant part of the error term.
- In this context, some people say that one main difference between the Fixed Effects Model and the Random Effects Model is that the former treats as fixed, while the latter treats as random.
- Besides: and this is another main difference with FEM. It means that REM is not designed for dealing with endogeneity issues caused by unobserved variables. Instead, it is for addressing the autocorrelated errors problem.
2.3 Model Estimation
Why are the error terms correlated here? let
We can see that the error terms from different times within the same individual are correlated. Hence, we have to resort to GLS to get a good estimate for . The transformation multiplier for GLS is:
Then do the data transformation:
We can use multiple ways to estimate λ and then transform the data. Finally, use OLS to do estimation on the transformed data, getting .