100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
ARMS Grasple aantekeningen + SPSS output voor het SPSS-tentamen $5.30
Add to cart

Other

ARMS Grasple aantekeningen + SPSS output voor het SPSS-tentamen

2 reviews
 237 views  18 purchases
  • Course
  • Institution

Alle theorie die behandeld is in SPSS voor ARMS! Hierin staan ook tabellen van SPSS en alle stappen die je moet uitvoeren voor elke analyse. Ook heb ik extra informatie uit de lessen en van het internet toegevoegd om het duidelijker te maken. Indien je dit document leest hoef je de Grasple less...

[Show more]

Preview 4 out of 41  pages

  • March 10, 2021
  • 41
  • 2020/2021
  • Other
  • Unknown

2  reviews

review-writer-avatar

By: lotteroosbrouwer • 3 year ago

review-writer-avatar

By: nanda_vanveen • 3 year ago

avatar-seller
ARMS GRASPLE notes 2020-2021


Introduction
Regression models describe the relationship between variables by fitting a line to the observed data.
Linear regression models use a straight line, while logistic and nonlinear regression models use a
curved line. Regression allows you to estimate how a dependent variable changes as the
independent variable(s) / predictor change.
-> We use linear regression to make predictions about linear relations.

Simple linear regression is used to estimate the relationship between two quantitative variables (you
only have one independent / predictor variable). You can use simple linear regression when you want
to know:

1. How strong the relationship is between two variables (e.g. the relationship between rainfall
and soil erosion).
2. The value of the dependent variable at a certain value of the independent variable (e.g. the
amount of soil erosion at a certain level of rainfall).


Correlation
= If two variables are correlated this means that a change in one of the variables will also mean a
change in the other variable.
= If two variables are correlated this means that changes in one variable vary along with changes in
another variable.

Characteristics of a correlation
-> A correlation is always between two variables, not between less or more variables.
-> Correlations only can be calculated at interval/ratio level.


Correlation coefficient / Pearson R
= standardized number to assess the strength of a linear relationship.
= standardized measure and multiple strengths of relationships can be compared because of that.
= Pearson R could only be calculated for qualitative variables.

R = +1 (or -1) Maximum strength of a linear relation between
two variables.
R=0 No correlation / linear relation between two
variables: this does not mean that there is no
relation, it just isn’t a linear relationship (but it
could be non-linear).

,Correlation is not the same as causation
A Correlation does not mean that the movement in one variable causes the other variable to move as
well. Correlation suggests an association between two variables: If two variables are correlated,
this means that changes in one variable vary along with changes in another variable. Causality shows
that one variable directly effects a change in the other. Although correlation may imply causality,
that’s different than a cause-and-effect relationship. See below.




Linear regression




1. Intercept (b0): the point where the regression line crosses the Y-axis
2. Slope (b1): if X increases by 1, how much does Y increases?
^ above Y means: it is not the observed Y-value but the predicted one.
If the X-value is zero, the Y-value is the same as the intercept. In this
example: if X = zero, Y is 40 because the intercept is 40.


SPSS-output for another example
Estimated intercept = 49.050
Slope = 3.466

,Error = residual




Residual = observed data – predicted data (fitted value)

We check for the line with the smallest possible sum of squared errors: we can find a linear
regression model which fits the data best.
= least square method.
-> this method is used for to estimate the parameters of the linear regression model. When we
square the errors, they will always be positive and they do not cancel each other. This way we can
look for the line that will result in the smallest possible sum of squared errors.


Goodness of fit
= how well the fit of the prediction (independent variable) is.
For example: R-squared

R-squared
= proportion of variance of the response variable that is explained by the predictor variable / the
model.
-> Value between 0 and 1.

If the R-squared is very small, this If the R-squared is very large, this
does not mean that there is no meaningful does not mean that the model is useful for
relationship between the two variables. The predicting new observations.
relationship could still be practically relevant, A very large R-squared could be due to the
even though it does not explain a large amount sample, and might not predict well in a
of variance. different sample.

-> the large R-squared can be caused by the
specific sample.


R-squared in the SPSS-output below = 8.3%

, Week 1 Linear Regression
Assumptions before performing a multiple regression
1. Measurement level
One condition for a multiple regression is that the dependent variable is a continuous measure
(interval/ratio). The independent variable / predictor should be continuous or dichotomous (nominal
with two categories. For example: gender, male = 1, female = 0).
2. Linearity
Second condition for the multiple regression is that there are linear relationships between the
dependent variable and all continuous independent variables.
-> you should make a scatterplot of each IV and the DV to check whether the relationship is linear. If
it is, it is possible to include this variable as an independent variable within the analysis.
3. Absence of outliers.
-> check this assumption with a scatterplot.
-> if there is an outlier, remove it from the data set.
-> produce a new scatterplot and compare the old and the new one.
-> if you compare both scatterplots and it is not possible to say whether the relationship is stronger
or weaker (with or without outlier), it is also possible to check the value of the R-squared: if it is
larger, the relationship is stronger.




Explained variance by the
model (the IV) with outlier.




Explained variance by the
model (the IV) without
outlier.

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller dk24. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $5.30. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

47561 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 15 years now

Start selling
$5.30  18x  sold
  • (2)
Add to cart
Added