Research Methods in Communication Science (S_RPPS)
Summary
Summary - Research Methods in Communication Science (S_RPPS)
17 views 0 purchase
Course
Research Methods in Communication Science (S_RPPS)
Institution
Vrije Universiteit Amsterdam (VU)
The "Research Methods" summary is a comprehensive guide covering six weeks of theoretical concepts and practical instructions for using R. From regression basics to advanced statistical techniques like MANOVA, this condensed resource is an essential tool for exam preparation, offering everything yo...
Research Methods in Communication Science (S_RPPS)
All documents for this subject (9)
Seller
Follow
isiswoutersen
Content preview
FINAL EXAM RESEARCH METHODS
Week 1 Regression Introduction and Multiple Regression
Linear regression: Y = a + b * x
Residuals = the difference between each observation and the model fitted to the
data (all observations)
Least squares method = a method to find the best fitting line, that reduces the error
(residuals) as much as possible. The line with the smallest possible residuals.
R-squared = the proportion of variance in the dependent variable that can be
explained by the independent variable. Shows how well the data fit in the regression
model. Number between 0 and 1, 1 is the best possible model in which all variance in
the dependent variable is explained.
F-test = tests whether the independent variables together explain a significant part of
the variance of Y.
T-test = tests whether a predictor variable makes a significant contribution to the
regression model. Provides some idea of how well a predictor predicts the outcome
variable.
T = b / SE, check the critical value (1.96, 1.645) for significance. Smaller? Not
significant
When we compare the effect of different variables in the same regression, we use
the standardized coefficient (beta). Beta is independent measuring unit.
Standardized X and Y: mean is 0 and SD is 1. The constant is always 0.
A multiple linear regression has always only one outcome variable and there can
be more than 2 predictors.
Statements:
- The sample is representative of its population and the observations are
independent
- There exists a linear relationship between the independent variables and the
dependent variable
- The residuals follow the normal distribution
Multicollinearity: when there are strong correlations between two or more predictors
in a regression model.
Variance Inflation Factor (VIF): a measure of the amount of multicollinearity in a set
of multiple regression variables. Rule:
Tolerance < 0.1 and VIF > 10, there is multicollinearity.
,Regression in R
Use a Chi-squared test to check if there is a significant relationship between two
variables.
F = mean square regression / mean square residual (get these numbers from the
output of ANOVA)
, Week 2 Regression with categorical predictors and Regression assumptions
Dummy variable: a variable that can take two values, 1 (presence of an attribute) and
0 (absence of an attribute)
For a variable with k categories, we make k-1 dummy variables.
To test the difference between these two dummies, use a t-test. The p-value is
(almost) the same as the difference.
Reference category is coded with 0 in all dummies. You choose the category that
makes sense to be a reference or the control condition.
Steps to check if a categorical variable is important:
- Run two model (one without the variable, one with)
- Look at the R-squared in both models
- Compare the R-squared with a F-test
- Check if this is significant. If it’s not, it does not increase the proportion of
variance of y explained by the regression model
Regression with categorical variables (dummies) is the same as one-way ANOVA.
Hierarchical regression = in each step the researcher adds one (or more) variables.
Multiple regressions where you add new things each time. This step should
theoretically make sense: does the R-squared increase in each step?
Regression assumptions: when can you generalize the results?
- Linearity, the relationship between X and U must be linear for each value of
the other X’s.
Check with scatterplot (bivariate regression) or residuals plot (multiple
regression). The red line should be approximately on the 0-line.
- Effect of normality of residuals, the distribution is normal.
Central limit theorem = parameters are normally distributed if you have a large
enough sample
Check with histogram.
- Homoscedasticity, the spread of residues stays the same across all values of
Y.
Check with residuals plot/graph, randomly scattered in the plot.
- Independence of residuals/errors, residues from the various observations
are correlated
Check with Durbin-Watson test. Test values <1 or >3 indicate autocorrelation.
Equal to 2 = no autocorrelation.
If linearity is violated, homoscedasticity is (usually) also violated.
Assumptions violated:
- Linearity, when there is a pattern of deviation from the 0 line in the plot.
Regression coefficients cannot be trusted
- Normality, histogram. Not normal is not a problem if the sample is large
enough (>30).
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller isiswoutersen. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $7.50. You're not tied to anything after your purchase.