regress File 4

regress File 4
[regress Contents] [Previous File] [Next File]

C. Derivation and Numerical Properties of OLS Estimates for the LRM

$\beta_1$

$\beta_2$

The Least Squares Approach

$\hat\beta_1$

$\hat\beta_2$

$\beta_1$

$\beta_2$

Predicted value of $Y_i$:: $\hat{Y_i} \equiv \hat\beta_1+\hat\beta_2 X_i$
Prediction error:: $e_i \equiv \hat{Y_i} - Y_i$
Sum of Squared Errors(SSE, also called RSS): $SSE \equiv \sum_{i=1}^N e_i^2$
The Ordinary Least Squares estimates of $\beta_1$ and $\beta_2$ are defined as: the particular values $\hat\beta^{OLS}_1$ and $\hat\beta^{OLS}_2$ of $\hat\beta_1$ and $\hat\beta_2$ that minimize SS for the sample data.

Graphical Representation of OLS Terminology

The Week 3 tutorial gives you a chance to experiment with fitting a linear line to a sample.

Derivation of OLS Estimates for the LRM

In class we derive the OLS estimates as:
$\eqalign{\hat\beta_2^{OLS} &= \sum_{i=1}^N {(X_i-\bar X)(Y_i-\bar Y)\over \sum_{i=1}^N(X_i-\bar X)^2}\cr \hat\beta_1^{OLS} &=\bar Y - \hat\beta_2 \bar X\cr}$

Note Because we won't be considering other estimators for several weeks, the superscript OLS will be dropped. Unless noted otherwise $\hat\beta_2$ refers to the OLS estimate of $\beta_2$ . Likewise for $\hat\beta_1$ .

The solutions come from solving the two normal equations
$\eqalign{\sum_{i=1}^N e_i &= 0\cr \sum_{i=1}^N e_i X_i &= 0\cr}$
which in turn are simply the first order conditions for minimizing SSE.

Numerical Properties of OLS Estimates

You should be able to derive each of the properties below on your own. The week 3 tutorial gives you the opportunity to verify these properties numerically, but they are also easy to prove symbolically as well.

Property 1: The regression line passes through the sample means of X and Y; $\bar Y = \hat\beta_1 + \hat\beta_2 \bar X$
Property 2: The mean of predicted Y values in the sample equals the sample mean of Y values; $\sum_{i=1}^N {\hat Y_i / N} = \bar Y$
Property 3: The mean (and sum) of prediction errors is zero; $\sum_{i=1}^N e_i = \sum_{i=1}^{N} e_i / N = 0$
Property 4: The prediction errors are uncorrelated with the predicted Y values; $\sum_{i=1}^N e_i \hat Y_i = 0$
Property 5: The prediction errors are uncorrelated with the sample values of X; $\sum_{i=1}^N e_i X_i = 0$
Property 6: The OLS estimates are weighted averages of the Y values; $\hat\beta_2 = \sum_{i=1}^N k_i Y_i = \sum_{i=1}^N k_i y_i$; $\hat\beta_1 = \sum_{i=1}^N (1/N - k_i\bar X) Y_i$
where for each observation i we define: $\eqalign{x_i &\equiv X_i - \bar X\cr y_i &\equiv Y_i - \bar Y\cr k_i &\equiv {x_i \over \sum_{j=1}^N x_j^2}\cr}$: The weights sum to 0, $\sum k_i = 0$ , and $\sum k_i^2 = {1\over \sum_{i=1}^N x_i^2}$ . These two properties of the OLS weights will prove useful in proving the Gauss-Markov Theorem later on.

Each of these properties spring directly or indirectly from the normal equations.

Examples of OLS Estimates

Purely Numerical

regress


*
*  Calculating OLS estimates of regression coefficients
*
. input X Y
      X    Y
  1.  1   .75
  2.  2   2.25
  3.  3   4
  4.  4   2.5
  5.  5   5.5
  6. end

. egen xbar = mean(X)                   Computes mean of X, puts in xbar
. egen ybar = mean(Y)                      
. gen x = X-xbar                        Generates deviation from mean
. gen y = Y-ybar
. gen xsq = x*x                         Generates deviation squared
. egen sxsq = sum(xsq)                  Generates sum of squared deviations
. gen k = x/sxsq                        Generates weight on Y
. gen kY = k*Y               
. egen b2hat = sum(kY)                  Generates b2hat (OLS estimate)
. gen b1hat = ybar - b2hat*xbar         Generates b1hat (OLS estimate)
. gen Yhat = b1 + b2hat*X               Generates Predicted Y (on OLS line)
. gen e = Y - Yhat                      Generates prediction error or residual
. gen esq = e*e                         Generates OLS error square

. list, nodisplay noobs

 X   Y   xbarybar  x    y   xsq sxsq   k      kY  b2hat b1hat Yhat    e    esq
--- ---- -------- ---  ---- --- ----  ----  ----- ----- ----- -----  ----- ----
1.0 0.75 3.0 3.0 -2.0 -2.25 4.0 10.0 -0.20 -0.150 0.975 0.075 1.050 -0.300 0.09
2.0 2.25 3.0 3.0 -1.0 -0.75 1.0 10.0 -0.10 -0.225 0.975 0.075 2.025  0.225 0.05
3.0 4.00 3.0 3.0  0.0  1.00 0.0 10.0  0.00  0.000 0.975 0.075 3.000  1.000 1.00
4.0 2.50 3.0 3.0  1.0 -0.50 1.0 10.0  0.10  0.250 0.975 0.075 3.975 -1.475 2.17
5.0 5.50 3.0 3.0  2.0  2.50 4.0 10.0  0.20  1.100 0.975 0.075 4.950  0.550 0.30

. regress Y X

  Source |       SS       df       MS                  Number of obs =       5
---------+------------------------------               F(  1,     3) =    7.88
   Model |     9.50625     1     9.50625               Prob > F      =  0.0674
Residual |     3.61875     3     1.20625               R-square      =  0.7243
---------+------------------------------               Adj R-square  =  0.6324
   Total |      13.125     4     3.28125               Root MSE      =  1.0983

------------------------------------------------------------------------------
       Y |      Coef.   Std. Err.       t     P>|t|       [95 Conf. Interval]
---------+--------------------------------------------------------------------
       X |       .975   .3473111      2.807   0.067      -.1302989    2.080299
   _cons |       .075   1.151901      0.065   0.952      -3.590862    3.740862
------------------------------------------------------------------------------

$\hat\beta_2$

$\hat\beta_1$

never

Stata also reports the SSE in the output table, although it calls the SSE the Residual Sum of Squares or (RSS). The interpretation of all values in the table will become apparent as we go along.

A Substantial Example



. gen female = dvsex - 1           * dvsex was coded 1,2 not 0,1
. regress cigs female

  Source |       SS       df       MS                  Number of obs =     600
---------+------------------------------               F(  1,   598) =    4.33
   Model |      403.44     1      403.44               Prob > F      =  0.0378
Residual |  55669.2533   598  93.0923969               R-squared     =  0.0072
---------+------------------------------               Adj R-squared =  0.0055
   Total |  56072.6933   599  93.6105064               Root MSE      =  9.6484

------------------------------------------------------------------------------
    cigs |      Coef.   Std. Err.       t     P>|t|       [95 Conf. Interval]
---------+--------------------------------------------------------------------
  female |      -1.64   .7877918     -2.082   0.038      -3.187175    -.092825
   _cons |   6.533333    .557053     11.728   0.000       5.439315    7.627351
------------------------------------------------------------------------------

conditional expectation

[regress Contents] [Next File] [Top of File]
This document was created using HTX, a (HTML/TeX) interlacing program written by Chris Ferrall.
Document Last revised: 1997/1/5

HyperMetricsNotes

regress File 4[regress Contents] [Previous File] [Next File]