
Simple linear regression model
Sampling distribution
First, we define the true population regressi on function as follows
y = 1 + 3z + ε (7)
where ε is the error term, which is normally distributed with a zero
mean and a constant variance. z is an indepe nd e nt variable that
follows a beta distribution. Note thaty is a linear combination of
(ε), which means that y is expected to be normally distributed.
We rand oml y sample 2,000 samples of 50 conversations without
replacement from a large population of 15000 observations. We
store the OLS estimators from the 2000 regr ess ion s to compute the
sampling distributions of the constant and the slope.
For large samples, we prove that the OLS estimators are normally
distributed and BLUE (see the RStudio example for i nst ruc ti ons
on the Monte Carlo simulations
).