An introduction to assumptions of Regression: A gentle introduction to residual analysis
12 views 0 purchase
Course
STA2604
Institution
University Of South Africa (Unisa)
An in-depth yet elementary exposition of regression and its power in statistical inference. Suitable for researchers in diverse fields of intellectual endeavor interested in exploring relationships between variables.
Standard Assumptions of Multiple-Linear
Regression: A Gentle Introduction to Residual
Analysis – Checking for Violations and Remedies
(Compiled by Emmanuel Makotsi-BSc Mathematics & Statistics)
(emmanuek5050@gmail.com ; +27621456161)
Assumption 1: Correct functional Form Assumption: Linear relationship
The model specified from the given data may be correct or incorrect.
In linear regression, the mean of the response yi is a linear combination of the
predictors x1i , · · · , xk1.
In simple words, the relationship between the predictor and
dependent(outcome/response) variables must be linear.
The linearity assumption can be tested with scatter plots or the Multiple R but these
are not sufficient as a quadratic relationship may appear linear on a scatter plot with
a high correlation. A more robust way to check for any violations is to use residuals.
If the functional form of a regression model is incorrect, the residual plots
constructed by using the model often display a pattern suggesting the form of a
more appropriate model. For instance, if we use a simple linear regression model
when the true relationship between y and x is curved, the residual plot will have a
curved appearance.
We illustrate the above with the output below.
The table below shows a bivariate sample data, that is ordered pairs of two variables
(x and y).
Note that y is the response/outcome variable and x is the predictor.
, The outputs below show the scatter plot indicating the relationship between the two
variables and the Pearson’s correlation coefficient is also shown with other sample
statistics.
Model Fit Measures
Model R R²
1 0.981 0.963
Scatterplot
At first glance, both the scatter plot and the Pearson’s correlation coefficient suggest a strong linear
relationship, but looking at the residuals plot below shows that a quadratic model will fit the data more
appropriately than a linear model.
Residuals plot
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller MAKOTSI. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $57.93. You're not tied to anything after your purchase.