100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4.2 TrustPilot
logo-home
Samenvatting

Summary Statistical Methods for the Social Sciences

Beoordeling
-
Verkocht
6
Pagina's
18
Geüpload op
12-12-2020
Geschreven in
2020/2021

This is a summary of the book Statistical Methods for the Social Sciences (fifth edition) by Alan Agresti. This book is used by Statistics 2, in the second year of studying Psychology at the University of Groningen.











Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Documentinformatie

Heel boek samengevat?
Nee
Wat is er van het boek samengevat?
H9 tm/ h12
Geüpload op
12 december 2020
Bestand laatst geupdate op
30 december 2020
Aantal pagina's
18
Geschreven in
2020/2021
Type
Samenvatting

Voorbeeld van de inhoud

Statistical Methods for the Social Sciences
Chapter 9; Linear Regression and Correlation
Regression analysis Methods for analyzing association between quantitative response and
explanatory variables.
We present three different, but related, aspects of regression analysis;

1. We investigate whether an association exits between the variables by testing the hypothesis
of statistical independence
2. We study the strength of their association using the correlation measure of association
3. We estimate a regression equation that predicts the value of the response variable from the
value of the explanatory variable

9.1; Linear Relationships
Linear function The formula y= α + βx expresses observations on y as a linear function
of observations on x. The formula has a straight-line graph with slope
β (beta) and y-intercept α (alpha).
Y= response variable and x= explanatory variable

We analyze how values of y tend to change from one subset of the population to another, as defined
by values of x.

At x=0, the equation y= α + βx simplifies to y= α + βx = α + β (0) = α.

The slope β equals the change in y for a one-unit increase in x. The larger the absolute value of β, the
steeper the line.

- When β is positive, y increases as x increases- the straight line goes upwards. When a
relationship between two variables follows a straight line with β>0, the relationship is said to
be positive.
- When β is negative, y decreases as x increases. The straight line then goes downward, and
the relationship is said to be negative.

When β=0, the graph is a horizontal line. The value of y is constant and does not vary as x varies, the
two variables are independent.

An association does not imply causation.

9.2; Least Squares Prediction Equation
Scatterplot A plot of the n observations as n points. The scatterplot provides a
visual check of whether a relationship is approximately linear.
Regression outlier The point falls quite far from the trend that the rest of the data
follow. The line seems to be pulled up toward that point and away
from the center of the general trend of points. An observation is
called influential if removing it results in a large change in the
prediction equation. Unless the sample size is large, an observation
can have a strong influence on the slope if its x-value is low or high
compared to the rest of the data and if it is a regression outlier.
Residual For an observation, the difference between an observed value and the
predicted value of the response variable, y− y is called the residual.
The prediction errors are called residuals.
Least squares estimate The least squares estimates a and b are the values that provide the

, prediction equation ^y =a+bx for which the residual sum of squares,
SSE= ∑ ( Y − ^y )2, is a minimum.
To estimate the line y= α + βx we use ^y =a+bx . This formula is called the prediction equation.

Σ ( x−x ) ( y− y )
b= 2
Σ ( x−x )
a= y−b x
- A positive residual results when the observed value y is larger than the predicted value y , so
y− y> 0.
- A negative residual results when the observed value is smaller than the predicted value. The
smaller the absolute value of the residual, the better is the prediction, since the predicted
value is closer to the observed value.

In a scatterplot, the residual for an observation is the vertical distance between its point and the
prediction line.

We summarize the size of the residuals by the sum of their squared values. This quantity, denoted by
SSE (sum of squared errors), is SSE= Σ ( y−^y )2. The better the prediction equation, the smaller the
residuals tend to be and, hence, the smaller SSE tends to be.

The prediction line ^y =a+bx is called the least squares line, because it is the one with the smallest
sum of squared residuals.

The least squares line;

- Has some positive residuals and some negative residuals, but the sum (and mean) of the
residual equals 0
- Passes through the point ( x , y )

9.3; The Linear Regression Model
Deterministic For the linear model y=α + βx , each value of x corresponds to a
single value of y. Such a model is said to be deterministic. It is
unrealistic in social science research, because we do not expect all
subjects who have the same x-value to have the same y-value,
instead, the y-values vary.
Conditional distribution It is the conditional distribution of the y-values at x=12. A separate
conditional distribution applies for those with x=13.
Probabilistic model A probabilistic model for the relationship allows for variability in y at
each value of x.
Expected Value of y Let E(y) denote the mean of a conditional distribution of y. The symbol
E represents expected value.
Regression function A regression function is a mathematical function that describes how
the mean of the response variable changes according to the value of
an explanatory variable.
Conditional standard The linear regression model has an additional parameter σ describing
deviation the standard deviation of each conditional distribution. That is, σ
measures the variability of the y-values for all subjects having the
same x-value.
An equation of the form E ( Y )=α + βx that relates values of x to the mean of the conditional
distribution of y is called a regression function.

, The function E ( Y )=α + βx is called a linear regression function, because it uses a straight line to
relate the mean of y to the values of x.

The estimate of σ uses SSE= Σ ( y−^y )2, which measures sample variability about the least squares
2
line. The estimate is s= SSE = Σ y− y .
√ n−2
(

^)
n−2
The term (n – 2) in the denominator of s is the degrees of freedom (df) for the estimate. When a
regression equation has p unknown parameters, then df = n – p. The equation E ( y )=α +β x has two
parameters (α ∧β ), so df= n – 2.
2
Σ ( y− y ) .It differs from the
Estimate of the population standard deviation of a variable y is

standard deviation of conditional distribution of y, for a fixed value of x.
n−1


9.4; Measuring Linear Association; The Correlation
Correlation The correlation between variables x and y, denoted by r, is
Σ ( x−x )( y − y )
r=

√ ( Σ ( x−x ) )( ∑ ( y− y )2 )
2

Correlation is a The correlation relates to the slope b of the prediction equation
standardized slope sx
^y =a+bx by r = ( )
sy
b.

The slope b of the prediction equation tells us the direction of the association. The slope does not
directly tell us the strength of the association. The slope is useful for comparing effects of two
predictors having the same units.
2
Σ ( x−x )
sx=
√ n−1
2
∑ ( y− y )
sy=
√ n−1
If the sample spreads are equal ( s x =s y ), then r=b. Because of the relationship between r and b, the
correlation is also called the standardized regression coefficient for the model E ( y )=α+ β x .

- Correlation is valid only when a straight-line model is sensible for the relationship between x
and y. Since r is proportional to the slope of a linear prediction equation, it measures the
strength of the linear association
- −1 ≤r ≤ 1. The correlation, unlike the slope b, must fall between -1 and +1
- R has the same sign as the slope b. This holds because their formulas have the same
numerator, relating to covariation of x and y, and positive denominators. Thus, r>0 when the
variables are positively related, and r<0 when the variables are negatively related.
- R=0 for these lines having b=0. When r=0, there is not a linear increasing or linear decreasing
trend in the relationship.
- r =±1 when all the sample points fall exactly on the prediction line these correspond to
perfect positive and negative linear associations.
- The larger the absolute value of r, the stronger the linear association.

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
isabelvdb Rijksuniversiteit Groningen
Bekijk profiel
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
572
Lid sinds
5 jaar
Aantal volgers
363
Documenten
0
Laatst verkocht
3 dagen geleden
Bachelor Psychologie &amp; Master Klinische Psychologie RUG

Hoi! Aan de Rijksuniversiteit Groningen heb ik de Bachelor Psychologie (2019 t/m 2022) en de Master Klinische Psychologie (2022 t/m 2023) gevolgd. Tijdens mijn studie heb ik uitgebreide samenvattingen gemaakt van bijna alle boeken en colleges die je kan vinden op mijn profiel :) Momenteel ben ik werkzaam als psycholoog en doe ik veel psychodiagnostisch onderzoek, en voer ik behandelingen uit (vaak in combinatie met e-health). Daarnaast ben ik sinds oktober 2023 in opleiding tot GZ-psycholoog. Mocht je vragen hebben over de inhoud van een samenvatting, hoor ik het graag!

Lees meer Lees minder
4,2

46 beoordelingen

5
18
4
20
3
6
2
1
1
1

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen