
related variables.
is the value of the
variable for the
observation.
For example,
might be income,
might be the price of milk,
might be number of children, etc. Then
equals the number of children in
household (or observation) 3. We begin numbering the X variables at 2 to hold a
place for the constant term.
is an unknown population parameter. Notice that the
simple linear regression model is the special case of the multiple regression model,
k=2. Since an economic variable like Y is undoubtedly influenced by many
potentially observable influences (like price, income) there is perhaps a better way
to interpret any linear regression model as a special case of a larger (greater k)
model. In particular, we should think of the smaller model as a difference in the
observed sample information. If we do not observe all the factors that influence
Y, then we have a a sample restricted in the X variables that we can observe. We
do not necessarily have the wrong model if we only include one variable in
the regression when there should be many.
The sample for a multiple regression model is written:
refers
to variable number 2), but we naturally write an equation on one line (or row),
forcing us to have
refer to a column rather than a row. With the notation
above, all the sample information is represented by one matrix equation.
in the LRM
is log-consumption of milk and
is the log of household income, then
is the income elasticity
of milk demand.