100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
College notes Data Analysis for EBE (30K215-B-6) after midterm $8.67   Add to cart

Class notes

College notes Data Analysis for EBE (30K215-B-6) after midterm

 62 views  5 purchases
  • Course
  • Institution

This document contains all slides of the lectures + explanation of the lecturer (very detailed) + all r-codes with explanation (how to get it and what it means) + the output in R-studio.

Preview 4 out of 56  pages

  • May 9, 2022
  • 56
  • 2020/2021
  • Class notes
  • Pavel cizek
  • All classes
avatar-seller
CHAPTER 22: Multiple linear Regression, Model violations

Motivation:

•The market-model example:
(Y = ‘daily stock price of Heineken’ on X= ‘daily price of AEX’)
-model requirements were checked graphically
-transformation of Y and X into daily returns (%) was suggested
-visual observations can be misleading
–proper tests are needed

•Amazon ebook sales: no checks have been done!
(Y = `dollar sales from published ebooks’ on X= `ebookprice’)

•Baseball teams’ performance: no checks have been done!
(Y= `runs per season’ on X= `on-base and slugging percentages’)

•Wage differences: no significant differences detected (H0). Is it due to H0 being valid, small sample
size, or invalid assumptions?

22.1 Collinearity (=if the correlation between 1 explanatory variable and linear combination of some
other explanatory variables is very strong, it can lead to collinearity)

-does not influence SSE and hence the usefulness of the model
-but interpretation of the regression coefficient becomes harder
-the values of t-tests are biased towards zero
-proving the individual significances may be hard

What can be done? (against collinearity)
-only take action if necessary (collinearity isn’t always the case, there is a possibility of it)
-possible action: remove a perpetrating variable from the model or transform them into linearly
independent components
-if caused by squared or interaction terms, the problem can occasionally be solved by switching to
centered variables (if it is possible), that is, using

22.3: Non-linearity

Is the linearity in the basic assumption E ( Y )=β 0 + β 1 X appropriate?
Consequences? Model and estimates are incorrect IF LINEARITY IS VIOLATED!
What can be done? Find a correct model specification (for example logarithms, or dummies, etc)

 This can often be detected by studying the residuals

The existence of non-linearity can be tested as follows:
-estimate the original model E ( Y )=β 0 + β 1 X 1+ ..+ β k X k
-create the variable of the accompanying predictions ŷ
-extend the original model by including the square of the prediction (for example, with coefficient γ =
gamma!):

, First estimate
the normal model, after that
extend the model with PREDICT2
with using the cbind function
 conclusion: model should be
extended to a non-linear one!



22.2: Heteroskedasticity (if homoskedasticity is violated!)




Or of its second-order counterpart with interactions. The usefulness of this model, H 0 : E ( ε 2 ) =γ 0
indicates the presence of heteroskedasticity (if the x_K’s are not equal to 0, there is
homoskedasticity)

What can be done?

,- Heteroskedasticity-consistent standard errors can be used to obtain confidence intervals/tests
for parameter values
- Weighted least squares (not addressed here!)
not discussed in
lecture, because
there is
homoskedasticity
here!




Aux model is
explained by a linear
of quadratic function!
 it is gamma0 +
gamma1X1
 or gamma1X1 +
gamma 2 X1^2




Third step: regress aux model on price e-book (first option above). Alternative: regress aux model on
price e-book and square of e-book price! (=second option above!). We have to look to F-statistic and
its p-value to check whether the auxiliary model is useful

, Possible solutions as H 0 :γ =0 is rejected (because p-value < any reasonable alpha!):

- Heteroskedasticity consistent standard errors
- Weighted least squares estimation, that is, standardizing data so that errors become
homoscedastic

This is still the amazon example, and now we know there is heteroskedasticity!




standard output =
valid under homo- AND
heteroskedasticity! BUT,
standard error, t-value and
p-value are only valid
under homoscedasticity (if
obtained with lm-
command!)

 = alternative procedure
how to obtain the errors
that are also valid under heteroskedasticity! (ESTIMATED ARE FOR BOTH EQUAL!)

22.3 Non-normality (= not crucial for outcome!)

Consequences:

-the LS estimators are generally not normally distributed
-the LS estimators are not optimal anymore
-the statistical conclusions thus cannot be trusted
-however, these problems are less serious for large sample sizes (CLT implies that the LS-estimators
are approximately normal) with the main exception being prediction intervals

 Non-normality can be detected with the Kolgomorov-Smirnov, Shapiro-Wilk, or Lilliefors test and
other test procedures (see chapter 24)

What can be done?

- A perfect remedy does not exist

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller Economiestudentje. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $8.67. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

60281 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$8.67  5x  sold
  • (0)
  Add to cart