100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
Regression Summary $2.96   Add to cart

Summary

Regression Summary

 1 view  0 purchase
  • Course
  • Institution

Summary of Regression notes.

Preview 2 out of 15  pages

  • November 12, 2023
  • 15
  • 2023/2024
  • Summary
avatar-seller
STA2020F Regression Summary Notes, 2019



Regression for STA2020F
William T. Msemburi1

Abstract
These notes provide a summary of the content covered in the regression section of the
STA2020F 2019 course. The purpose of these notes is to provide an overview of the regres-
sion core material in a condensed format to assist in the revision of the concepts - reinforcing
what is taught in more detail in the course lectures and what is described in the regression
section slides. In addition, relevant RStudio code and examples are shown.
1 Lecturer, Department of Statistics, University of Cape Town, South Africa



Contents Scatter Diagrams
A first step in any correlation/regression analysis
1 Correlation 1 should be a plot of the data. The basic graphical
2 Simple Linear Regression 2 technique for the two-variable situation is the scat-
ter diagram. In general the data refer to a num-
3 Multiple (Linear) Regression 5
ber of individuals, each of which provides observa-
4 Model building 10 tions on two variables. In the scatter diagram each
5 Logistic regression 12 variable is allotted one of two co-ordinate axes and
each observation defines a point, of which the co-
ordinates are the observed values of the two vari-
1. Correlation ables. The scatter diagram gives a compact illustra-
tion of the relationship between the two variables.
Introduction For example we may want to quantify the relation-
Regression and correlation analyses are techniques ship between the age of a child (months) and a
for quantifying the relationship between two or more child’s height (cm):
continuous variables. You make this kind of re-
lationships in your head all the time, for example Table 1. Age and Height data
when you calculate the age of a child based on her
height, you are assuming the older she is, the taller Age Height
she will be. We shall concentrate on situations in 18 76.1
which the general trend is linear; that is, as one vari- 19 77.1
able changes the other variable follows on the aver- 20 78.1
age a trend which can be represented by a straight 21 78.2
line. Our objectives in the Correlation section are 22 78.8
to learn about: 23 79.7
24 79.9
25 81.1
1. Visualising data using a scatter-plot. 26 81.2
2. The definition and properties of the correla- 27 81.8
tion coefficient. 28 82.8
3. Estimating and interpreting the correlation 29 83.5
coefficient.

, Regression for STA2020F — 2/15


In RStudio we input the data and generate the 2. Simple Linear Regression
plots using:
Introduction
Age = c(18,19,20,21,22,23,24,25, Simple Linear Regression is a technique for de-
26,27,28,29) scribing the relationship between the values of two
Height = c(76.1,77.1,78.1,78.2,78.8, continuous variables that enables us to predict the
79.7,79.9,81.1,81.2,81.8,82.8,83.5) value of one variable for an individual when we
only know the value of the other variable. Lin-
plot(Age,Height, main="Height ˜ Age")
ear regression refers to a straight line relationship
between variables. Simple linear regression refers
Height ~ Age
to the situation where there is only one explanato-
ry/independent variable. The method fits a straight
line to the data that in some sense gives the “best”
82




prediction of y for any value of x. This will be a
line that minimises the distance between the data
and the fitted line. Our objectives in the Simple
Height

80




Linear Regression section are to learn:
1. The parameters of the mathematical model
78




for such a relationship
2. How to estimate the parameters of this model
using RStudio
76




18 20 22 24 26 28 3. How to interpret the meanings of the esti-
Age
mated parameters
We observe that there is positive linear relationship 4. How to test if the relationship is statistically
and proceed to quantify it. significant
5. How to predict values of y given a new value
Correlation coefficient of the explanatory variable x.
The strength of the linear association between two 6. How to set up confidence intervals for the es-
variables y and x can be summarised by the sample timates and predicted values.
correlation coefficient, rxy . For n pairs of observa-
tions (yi , xi ), i = 1 . . . n, the correlation coefficient The Linear Model
is defined by: We assume

∑n (xi − x̄)(yi − ȳ) SSxy y = β0 + β1 x + ε where ε ∼ N(0, σ 2 )
rxy = √ n i =√
∑i (xi − x̄) ∑i (yi − ȳ)
2 n 2 SSx SSy In this expression:
where x̄ = the mean of the n values for x and ȳ = the • y =⇒ Response /Dependent variable,
mean of the n values for y. Note that the correlation • β0 =⇒ Intercept/Expected value of y when
coefficient is a dimensionless quantity between -1 x = 0,
and +1. In RStudio this correlation coefficient • β1 =⇒ Regression coefficient/Slope of the
is estimated using line/ Amount of change in y for a 1 unit change
in x,
cor(Age, Height) • x =⇒ Predictor/Independent variable,
• ε =⇒ Error term (since not all points will
## [1] 0.9942309 lie on the line exactly).
In this course, we do not perform any hypothesis Estimates are derived using RStudio as shown
tests related to rxy . in Figure 1.

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller claudiacloete112. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $2.96. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

76800 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$2.96
  • (0)
  Add to cart