100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
Correlation and Regression Analysis CA$7.16
Add to cart

Class notes

Correlation and Regression Analysis

 3 views  0 purchase

An Introduction to correlation and Regression Analysis

Preview 2 out of 5  pages

  • January 7, 2025
  • 5
  • 2024/2025
  • Class notes
  • Olivia podolak lewandowska
  • All classes
All documents for this subject (8)
avatar-seller
sobikaaravi
Correlation & Regression‬


‭Definitions‬
‭➔‬‭correlation and regression are‬‭statistical techniques‬‭used to examine relationships between‬
‭variables‬
‭◆‬ ‭correlation‬‭: determines the strength of an association‬‭between two quantitative‬
‭variables‬
‭◆‬ ‭simple regression‬‭: predicts one quantitative dependent‬‭variable from an independent‬
‭variable‬
‭●‬ ‭dependent variable = y or the criterion‬
‭●‬ ‭independent variable = x or the predictor‬
‭◆‬ ‭multiple regression‬‭: predicts one criterion from multiple‬‭predictors‬

‭Pearson’s Product-Moment Correlation Coefficient (R)‬
‭➔‬‭r‬‭is the coefficient that represents the strength‬‭and the direction of the linear relationship‬
‭between two variables‬
‭◆‬ ‭correlation of a‬‭sample‬‭= “Pearson’s r” or just “r”‬
‭●‬ ‭lowercase “r” because uppercase R represents multiple regressions‬
‭◆‬ ‭correlation of a‬‭population‬‭= ⍴ (rho)‬
‭◆‬ ‭absolute value‬‭of r determines‬‭strength‬‭of the relationship‬‭between x and y‬
‭●‬ ‭the magnitude of the value (regardless of positive or negative)‬
‭●‬ ‭r = 0; no correlation‬
‭●‬ ‭r = -1 or 1; perfect correlation‬
‭○‬ ‭very rare/highly unlikely‬
‭○‬ ‭means x can perfectly predict y‬
‭◆‬ ‭the‬‭sign‬‭(positive or negative) determines the‬‭direction‬‭of the relationship‬
‭●‬ ‭- value = negative relationship‬
‭●‬ ‭+ value = positive relationship‬
‭➔‬‭the values for r or rho are‬‭always‬‭between‬‭-1 and‬‭1‬
‭➔‬‭think of r as a way of looking at‬‭how closely the‬‭data clusters around the regression line‬
‭◆‬ ‭scatterplots are useful for this (‬‭review of scatterplots‬‭in the slides from lecture‬‭)‬

‭Testing A Correlation‬
‭➔‬‭to determine whether correlation signifies a sampling error or an actual relationship existing,‬
‭you would have to run a‬‭t-test‬

, ‭➔‬‭first step‬‭= state the hypothesis and the degrees of freedom‬
‭◆‬ ‭for two-tailed test‬
‭●‬ ‭H‬‭0‬ ‭= no correlation between the 2 variables (p=0)‬
‭●‬ ‭H‬‭A‬ ‭= there is correlation between the 2 variables‬‭(p≠0)‬
‭◆‬ ‭for one-tailed test‬
‭●‬ ‭state the directionality of the correlation‬
‭◆‬ ‭DF‬‭= n - 2‬
‭●‬ ‭n = the number of points of x and y together‬
‭➔‬‭second step‬‭= state the assumptions‬
‭◆‬ ‭the DV and the IV are both normally distributed‬
‭◆‬ ‭there are no outliers in either the DV or the IV; no bivariate outliers‬
‭●‬ ‭bivariate outliers = outliers when considering both the variables together‬
‭●‬ ‭correlations are not resistant to outliers, especially when the‬‭n‬‭is small‬
‭◆‬ ‭the DV and IV are linearly related‬
‭●‬ ‭correlations only capture linearity‬
‭●‬ ‭no way to actually test this, would just have to look at a scatterplot‬
‭○‬ ‭hence for this class, just state that it’s assumed‬
‭◆‬ ‭the correlation between the two variables must be significant to run the simple‬
‭regression‬
‭●‬ ‭regression equation is only calculated if the correlation is significant, in other‬
‭words if the null is rejected‬
‭➔‬‭third step‬‭= find Pearson’s‬‭r‬
‭➔‬‭fourth step‬‭= test the correlation using t-critical‬
‭➔‬‭fifth step‬‭= if the null is rejected (aka significance‬‭is found) calculate the regression equation‬

‭Effect Size for R‬
‭➔‬‭effect size for r = r‬‭2‬
‭➔‬‭similar to Cohen’s‬‭d‬
‭◆‬ ‭d‬‭tests the magnitude of the difference of the two‬‭variables‬
‭◆‬ ‭r‬‭tests the magnitude of the relationship of the two‬‭variables‬
‭●‬ ‭the proportion of variance explained‬
‭➔‬‭r‬ ‭explains‬‭how much of the variance‬‭in one of the‬‭variables(y) can be explained by the‬
‭2‬


‭relationship of it with the other variable (x), the‬‭rest being attributed to error‬
‭◆‬ ‭i.e. r‬‭2‬ ‭= 0.9025; hence 90.25% of the variance in‬‭weight can be explained by the‬
‭relationship between age and weight, the rest being attributed to by error‬
‭◆‬ ‭basically looking at the overlap between variable x and variable y‬

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller sobikaaravi. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for CA$7.16. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

51292 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 15 years now

Start selling
CA$7.16
  • (0)
Add to cart
Added