Garantie de satisfaction à 100% Disponible immédiatement après paiement En ligne et en PDF Tu n'es attaché à rien
logo-home
Samenvatting van de hoorcollege(s) van Quantitative Methods €8,49
Ajouter au panier

Resume

Samenvatting van de hoorcollege(s) van Quantitative Methods

 0 fois vendu
  • Cours
  • Établissement

Samenvatting van alle hoorcolleges tijdens het vak Quantitative Methods. Per thema is het samengevat waarbij de literatuur ook is raadgepleegd.

Aperçu 3 sur 24  pages

  • 11 janvier 2023
  • 24
  • 2022/2023
  • Resume
avatar-seller
Samenvatting Quantitative
methode
Hoorcolleges

Theme 1: Intro, Variables and Techniques, OLS
Inductive research is quantitative research, a research were you look at new methods or
create them. Deductive research is qualitative research, a research about the current
theories and methods.

Validity is whether an instrument measures what it sets out to measure.
Reliability is whether an instrument can be interpreted consistently across different situations.

There are different variable types and methods of analysis:
1. Response variable (dependent variable) VS explanatory variable (independent
variable)
Dependent variable is a variable thought to be affected by changes in an independent
variable. You can think of this variable as an outcome.
Independent variable is a concept that you are using to explain the concept. For
example: size, age
2. Manifest VS latent variable
Manifest variable is a concept that is measurable which is directly use in the analysis,
think of age or gender.
Latent variable is a concept that is not directly observable like globalizing –
international migration

It is important to know which variable you need to take and combine a level of measurement
There are three different levels of measurement:
- Nominal
The values are equal like the term ‘color’
We use a frequency and there is no order or mean.
- Ordinal
The values are not equal like the term ‘satisfaction or rank’
Ordinal values have an order, but this order may not be equal.
- Interval/Ratio (Metric)
Are things that can be measured like the mean, medium, weight or age.

Dependent variable Method
Metric (Interval + ratio) Linear regression (OLS)  Theme 1
Ordinal Ordered logit regression
Nominal Multinomial logistic regression
- Binary/binomial Logistic regression  Theme 2
Changes in time (metric) Time series analysis  Theme 4
Spatial dimension Spatial Analyses  Theme 4

Linear regression
- The conditional expectation of a continuous variable Y is expressed as a linear function
of the explanatory variables X1, X2
- Specific observations deviate randomly form the expected value, so we add a random
error term to the model

,- The regression line is estimated with help of the least squares method: take the line, for
which the sum of squared residuals is as small as possible. The residual is the difference
between the observed and the predicted value.

R-Square (goodness-of-fit) measures how well the model fits the observations, the share of
the variation of Y that is explained by the model. If the dots are perfectly on the linear line
than we would have a R-Square of 100%. Through the R-Square you can answer two
questions; can we use the model and how good does the model fit.

Check model assumptions before you draw the conclusions. You need to look if the model
has a ‘good fit’ before you use it.
1. The sample consists of independent observations. The indicators need to be
independent. For example age and size are independent.
2. A linear model is suitable, that is, the relationship between the dependent variable
and the independent variables is linear.
3. The variance of the residuals is equal for all possible values of the independent
variables (constant variance or homoscedasticity  similar variances in different
groups being compared)
The residuals observation needs to be around the 0-line throughout the spectrum,
otherwise the test are unreliable for a certain range
4. The residuals are normally distributed
The idea of a bell shaped distributed, the importance of it is that it is a requirement for
us to use hypotheses testing. The average shows us that one standard deviation
covers 2/3 of the whole observation. 95% of the distribution are with two standard
deviations. 2/3 standard deviations covers 100% of the distribution.
 Linear regression models that predict non-metric dependent variables fail to meet these
conditions. Therefor we use non-linear regression models; the discrete choice models.




Outliers are the dots that are further away from the linear line and any of the other dots. The
outliers are dots that don’t have a relation with other dots. The outliers have impact on the
model if there are too many. The outliers can make the model different because the linear
line can be lower because of the outliers.
How can you detect outliers? Look at the observations beyond three standard deviations of
the mean or look at the boxplots, histograms, probability plots or scatter plots.

Study impact of influential cases:
- Idea is to compare regression outcomes with an without influential cases
- SPSS: influence of case on individual coefficients (DFBETA) and on the model fit (DFFIT)
- Influential cases with Cook’s distance > 1
Cook’s distance on SPSS: if it is higher than 1, than there could be an outlier

Test for multicollinearity

, Multicollinearity is the correlation between two (or more) explanatory variables is too high
and you can’t explain which of the variable (X or Y) would cause the problem or is
connected. You can calculate it with the R (R < 0,8 or 0,9).
- The problem with multicollinearity is that standard errors of regression coefficients
increase. They are untrustworthy coefficients.
- There is a limits size of R
- The interpretation of relevance of individual explanatory variables becomes impossible

Rules of thumb for detection:
VIF > 10 (or tolerance < 0,1)  indicates serious problem of multicollinearity
VIF substantially higher than 1 (or tolerance < 0,2)  multicollinearity may be a problem.

Linear regression: Model extensions and alternative model specifications
1. Dummy variables
Dummy variables are to include qualitative variables in regression. It is mostly an upward
parallel shift of downward parallel shift. With the dummy variables you can better look at the
results and divide them in different groups. The dummy variables fix the model because you
can see which results in which categories is placed and you can easily see the difference.
With the fixed effects you filter the categories out of it. The downside of dummies is that it
filters the groups which makes it hard to analyze. The dummies captures everything and you
don’t see what it is really. You don’t know what is driving and what the main cause is. The
dummy contradicts the problem but not the story behind it. Dummies is reliable but you don’t
know what they capturing.
We always use on dummy less than the amount of categories because one dummy is the
reference category.

When do you need to use dummy variables?
Independent variable Use of dummy variable?
Continuous Not necessary
Ordinal Not necessary
Dichotomous Yes
Nominal (more than two categories) Create help variables using dummies
(number of dummies = number of
categories minus 1)

The fixed effects model
A fixed effects model includes a set of dummy variables each of which represents a group. It
is useful to distinguish between within group and between group effects and to control for
intergroup variation that blurs interpretation of effects that are central in study.

2. Interaction variables
We speak of an interaction if the effect of an independent variable is influenced by a second
independent variable. The interaction effect is the effect that the relationship between two
variables is influenced by an extra variable. The independent variable is influenced by a
second independent variable. In the linear model there will be an interaction term added to
divide the section of the variable. For example you can divide the variables in low prior
education and high prior education. We speak of an interaction when the regression line is
not parallel.

3. What to do in case of non-linearity?
1. Add non-linear term
Quadratic regression model
2. Transformation of variables
Logarithm, square foot, reciprocal of number
3. Other model specifications

Les avantages d'acheter des résumés chez Stuvia:

Qualité garantie par les avis des clients

Qualité garantie par les avis des clients

Les clients de Stuvia ont évalués plus de 700 000 résumés. C'est comme ça que vous savez que vous achetez les meilleurs documents.

L’achat facile et rapide

L’achat facile et rapide

Vous pouvez payer rapidement avec iDeal, carte de crédit ou Stuvia-crédit pour les résumés. Il n'y a pas d'adhésion nécessaire.

Focus sur l’essentiel

Focus sur l’essentiel

Vos camarades écrivent eux-mêmes les notes d’étude, c’est pourquoi les documents sont toujours fiables et à jour. Cela garantit que vous arrivez rapidement au coeur du matériel.

Foire aux questions

Qu'est-ce que j'obtiens en achetant ce document ?

Vous obtenez un PDF, disponible immédiatement après votre achat. Le document acheté est accessible à tout moment, n'importe où et indéfiniment via votre profil.

Garantie de remboursement : comment ça marche ?

Notre garantie de satisfaction garantit que vous trouverez toujours un document d'étude qui vous convient. Vous remplissez un formulaire et notre équipe du service client s'occupe du reste.

Auprès de qui est-ce que j'achète ce résumé ?

Stuvia est une place de marché. Alors, vous n'achetez donc pas ce document chez nous, mais auprès du vendeur evavelthuijs. Stuvia facilite les paiements au vendeur.

Est-ce que j'aurai un abonnement?

Non, vous n'achetez ce résumé que pour €8,49. Vous n'êtes lié à rien après votre achat.

Peut-on faire confiance à Stuvia ?

4.6 étoiles sur Google & Trustpilot (+1000 avis)

64670 résumés ont été vendus ces 30 derniers jours

Fondée en 2010, la référence pour acheter des résumés depuis déjà 15 ans

Commencez à vendre!

Récemment vu par vous


€8,49
  • (0)
Ajouter au panier
Ajouté