Samenvatting

Summary Applied Statistics I (424023-B-6) - STATISTICS PART

0 keer verkocht

Instelling
Tilburg University (UVT)

This summary covers all subjects seen in the statistics class given by Luc van Baest. However, it doesn't cover the methodology and SPSS part, so I recommend you take a look at the bundle that includes all three summaries (Statistics + Methodology + SPSS)

[Meer zien]

Voorbeeld 4 van de 39 pagina's

Bekijk voorbeeld

Heel boek samengevat? Ja
Geupload op 27 oktober 2021
Aantal pagina's 39
Geschreven in 2021/2022
Type Samenvatting

€7,39

Ook beschikbaar in voordeelbundel v.a. €12,99

In winkelwagen

Opslaan

100% tevredenheidsgarantie
Direct beschikbaar na je betaling
Lees online óf als PDF
Geen vaste maandelijkse kosten

Ook beschikbaar in voordeelbundel (1)

Methodology, Measurement & Statistics - 3 PARTS: METHODOLOGY + STATISTICS + SPSS

€ 15,77 € 12,99

3x verkocht

3 items

1. Samenvatting - Summary applied statistics i (424023-b-6) - statistics part
2. Samenvatting - Summary - methodology, measurement and statistics for pm students (424023-b-6) - meth...
3. Samenvatting - Summary spss lab sessions - methodology, measurement and statistics for pm students ...
Meer zien

PART 1: INTRODUCTION TO STATISTICS
We use “Statistics” to…

1) Describe/summarize data: descriptive statistics.
Reduce the data to understandable pieces of information.
2) Drawing inferences about populations: inferential statistics
In practice, we only have observations on a selection of cases from a larger population; we
need to evaluate whether the results in the sample are generalizable to the population.
3) Studying complex multivariate relationships: statistical modelling
In practice, we are interested in relationships between several variables. Statistics help to
uncover complex relations.

CH1: MEASUREMENT LEVELS & DATA
We make a basic distinction between 3 “types” of data, known as measurement levels:

1. Nominal Data: numbers express group membership.

 Marital status:

1= single
2= married
3= in a serious relationship, but no married
4= not specified otherwise

Nominal variables classify cases into two or more categories. Categories must be
exhaustive (all possibilities should be covered) and mutually exclusive (i.e., every case
fits into one category and one category only).

2. Ordinal Data: numbers express an ordering (less/more)
 Smoking intensity:

1= never
2= occasionally
3= regularly (at least 1 cigarette per day).
4= heavy (more than 5 cigarettes per day).

Numbers expresses more or less of a quantity, but the difference between 1 and 2 is not
the same in quantity than between 2 and 3, 3 and 4, and so on.

3. Interval and Ratio (Scale-Level)
 Data: numbers express differences in quantity using a common unit.
E.g., the difference between 70 and 80 IQ points is comparable to a difference between
100 and 110. Both span a difference of 10 units.

1

,Within scale data we can distinguish between ratio-level and interval-level:

 Ratio-level data have a “natural” zero-point (e.g., length, income). As a result, you can
compare the relative magnitude of things, e.g., you can say that one person is twice as tall
as another person.
 Interval level variables don’t have a natural zero point (e.g., “zero temperature is
meaningless), but it is arbitrarily chosen and can differ across scales (e.g., zero Fahrenheit
≠ zero Celsius).
With interval data one can compare intervals, but in contrast to ratio level one cannot
say, for example, that 30 degrees is two times hotter than 15 degrees….

Remarks:

 Both interval and ratio-level data are referred to as scale data. The idea is simple:
all variables that are not nominal or ordinal are treated as scale-level variables
(levels used in SPSS)

 Measurement level is a property of the measurement values, it is not (!) an
intrinsic property of a variable. E.g., you cannot say that “body height” has interval
level; length can be measured at different levels (think of your gym-class…).
So there is not necessarily a one-to-one relationship between measurement levels
and variables.

 Measurement levels determine the kind of statistics and statistical analyses you can
use meaningfully. For example, the mean of a nominal variable is meaningless
(e.g., “the average eye colour). Hence, for the analyses you should always respect
the measurement levels of the variables envisaged.

 Many of the commonly used statistical techniques assume scale data. However, for
many variables in the social sciences, it is not evident that data are interval level
(e.g., political interest?) Therefore, it is common practice to simply assume that we
have acquired interval data, without worrying too much if this is really true and this
turns out to be very useful.

CH2: DATA INSPECTION
Every analysis starts with data inspection (“getting to know your data”): its goal is to get
a clear picture of the data by examining one variable at the time (univariate), or pairs of
variables (bivariate):

To accomplish this goal, we use graphs and statistics. Which statistics and graphs are most
appropriate depends on the measurement level (i.e., whether the data are nominal,
ordinal, or scale level)?

2

,In general, we want to know more about…

 Central tendency: What are the most common values?

 Variability: How large are the differences between the subjects? Are there
extreme values in the sample?

 Bivariate Association: for each pair of variables, do they associate/co-vary (i.e.,
do low/large values on one variable go together with low/large values on the other
variable) PART 3

Ordinal/nominal data Scale-level data

CH3: CENTRAL TENDENCY
 Mode: the score that is observed most frequently.
 Median: the score that separates the higher half of data from the lower half

Example 1: ( N = unequal): 4, 5, 6, 7, 8, 9, 9 => median is 7

Example 2: ( N = equal): 4, 5, 6, 8, 9, 9 => median 7

 Mean M =
∑X= ∑ of all scores
N total number of scores
Example:

Resp. 1 2 3 4 5 6 7 8 9 10

grade 3 5 9 9 7 9 8 6 10 4

Central Tendency Measures
 Mean: 7 (70/10)
 Median: 3, 4, 5, 6, 7, 8, 9, 9, 9, 10 => 7.5
 Mode/modus: 9

3

, Bi-modal = two of the highest frequencies are equal

CH4: DEVIATION SCORES
Deviation score : the difference between a score Xi (score of respondent with number i)
and the Mean score.

- 3 respondents (N=3)
- Mean = 5
- Deviation scores = -2, -1, +3.

Sum of deviation to the left is
equal to sum of deviation to the right.

∑ ( xi−M )=0 (3-5) + (4-5) + (8-5) = 0

CH5: VARIABILITY
= not everyone scoring the same

 Sum of deviation scores from the mean = 0

Problem! When measuring variability it will always end up being 0; tells us nothing.
Solution Take the square of the sum of deviation scores to make it positive
 Σ(X – M)2
Notation  Sum of Squares SS = Σ(Xi – M)2

MEASURES OF VARIABILITY
For the exercise above:
VARIANCE: S2 =
∑ ( X −M ) ² = SS Scores (X) = 3,4,8
N−1 df Mean (M) = 5
Deviation scores = (X-M) = -2,
STANDARD VARIANCE (SD): -1, 3
S= √ S ²

MINIMUM/MAXIMUM: lowest/highest observed value S= √ 7=2.65

RANGE: maximum - minimum
NB: The larger the variance
the more different answers
you get.
INTERQUARTILE RANGE (IQR): ranges of scores that
encompass 50% of the middle observations; thus,
excluding the 25% lowest & 25% highest observations
4

Dit zijn jouw voordelen als je samenvattingen koopt bij Stuvia:

Bewezen kwaliteit door reviews

Studenten hebben al meer dan 850.000 samenvattingen beoordeeld. Zo weet jij zeker dat je de beste keuze maakt!

In een paar klikken geregeld

Geen gedoe — betaal gewoon eenmalig met iDeal, creditcard of je Stuvia-tegoed en je bent klaar. Geen abonnement nodig.

Direct to-the-point

Studenten maken samenvattingen voor studenten. Dat betekent: actuele inhoud waar jij écht wat aan hebt. Geen overbodige details!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper cmuyldermans. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €7,39. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 65004 samenvattingen verkocht

Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen

Begin nu gratis

Laatst bekeken door jou

Tentamen (uitwerkingen) ·

(0)

SBB- ABO QUESTIONS WITH COMPLETE SOLUTIONS

Samenvatting ·

(0)

Samenvatting

Summary Applied Statistics I (424023-B-6) - STATISTICS PART

Document informatie

Onderwerpen

Gekoppeld boek

Meer samenvattingen voor studieboek

Geschreven voor

Verkoper

Voorbeeld van de inhoud