Correlation versus regression
Correlation and regression both rely on the same kind of calculations, but whatever correlation can
do, regression can do as well but also much more. When you’re making a regression, don’t refer to
the outcomes as a correlation!
Regression analysis
Technique to understand and quantitatively summarize relationships among variables.
Learn the basis of this technique.
Learn how to apply this technique.
Learn how to interpret this technique.
Relations between variables
Dependent variable Y = Variable to be explained.
Independent variable X = Explanatory variable.
Regress Y on X.
Causal effect is often hypothesized, but not necessarily.
o Positive and negative effects.
Examples in public administration
What is the relationship between:
Civil servant motivation and output?
Municipality spending and economic growth?
Law enforcement effort and crime rates?
Management strategies and school success?
Correlation coefficient (rho) or r
Degree of strength of (linear) association between two variables.
Is the standardized covariation between two variables X and Y.
The covariation between two variables is the way that we put those two things together.
When we have more of one, do we have more or less of the other?
Standardization with respect to scale (variation in X and variation in Y).
The correlation coefficient is also standardized, because we want some kind of metric that
tells us the same kind of information regardless of what me measure. This way the result will
always range between -1 and 1.
What is a variance?
The variance is the degree of difference in scores. It shows how far a score is relative to the average.
Product of deviances
Covariance (X, Y) = Sum of product of deviances in X and Y for all data points i.
, Variance (X) = Sum of squared deviances in X.
Variances (Y) = Sum of squared deviances in Y.
Interpretation
The correlation coefficient is a statistic/ numerical summary of the strength of a linear relationship
between X and Y.
Ranges from -1 to +1.
+1 means strong positive correlation or strong positive (linear) relationship.
-1 means strong negative correlation.
0 means no (linear) relationship.
Additional interpretation
The slope of the regression line.
The correlation coefficient can’t distinguish the difference between the lines in the middle row, but
can distinguish the difference between the lines in the top row.
Even though there’s a pattern in the bottom row, the correlation coefficient can’t tell us anything
about the existence of those patterns, because correlations coefficients measure the strength of the
linear relationships between X and Y. The bottom row doesn’t contain linear relationships.
,Perfect positive correlation
No correlation
Scale of Y is smaller
, Y does not vary
Not so perfect correlation
Outlier effect for small n
Voordelen van het kopen van samenvattingen bij Stuvia op een rij:
Verzekerd van kwaliteit door reviews
Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!
Snel en makkelijk kopen
Je betaalt supersnel en eenmalig met iDeal, creditcard of Stuvia-tegoed voor de samenvatting. Zonder lidmaatschap.
Focus op de essentie
Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!
Veelgestelde vragen
Wat krijg ik als ik dit document koop?
Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.
Tevredenheidsgarantie: hoe werkt dat?
Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.
Van wie koop ik deze samenvatting?
Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper VHouten. Stuvia faciliteert de betaling aan de verkoper.
Zit ik meteen vast aan een abonnement?
Nee, je koopt alleen deze samenvatting voor €6,44. Je zit daarna nergens aan vast.