100% tevredenheidsgarantie Direct beschikbaar na betaling Zowel online als in PDF Je zit nergens aan vast
logo-home
Summary Statistics Pre-MSc RUG 2023/2024 €6,42   In winkelwagen

Samenvatting

Summary Statistics Pre-MSc RUG 2023/2024

1 beoordeling
 87 keer bekeken  14 keer verkocht

Summary for Statistics Pre-Master courses 2023/2024

Voorbeeld 4 van de 33  pagina's

  • 10 oktober 2023
  • 33
  • 2023/2024
  • Samenvatting
Alle documenten voor dit vak (1)

1  beoordeling

review-writer-avatar

Door: niekholterman • 1 maand geleden

Komt weinig overeen met het boek, mist veel informatie

avatar-seller
kirstenzjyh
Statistics Pre-MSc
Types of data
There are four main types of data: nominal, ordinal, interval, and ra6o. Each type has
specific characteris6cs that determine the kind of analysis that can be performed on it.

Nominal Data:
Nominal data consists of categories or labels without any inherent order. The categories are
dis6nct and mutually exclusive.
Examples:
• Colors (e.g., red, blue, green)
• Types of animals (e.g., cat, dog, bird)
• Marital status (e.g., single, married, divorced)

Ordinal Data:
Ordinal data involves categories with a meaningful order or ranking. The intervals between
categories are not necessarily equal.
Examples:
• Educa6on levels (e.g., high school, college, graduate school)
• Star ra6ngs for a movie (e.g., 1 star, 2 stars, 3 stars)
• Socioeconomic status (e.g., low, middle, high)

Interval Data:
Interval data has ordered categories with equal intervals between them.
However, it lacks a true zero point, meaning that ra6os (e.g., twice as much) cannot be
calculated.
Examples:
• Temperature in Celsius or Fahrenheit (e.g., 0°C doesn't mean "no temperature")
• IQ scores (e.g., 100 is not "twice as intelligent" as 50)

Ra3o Data:
Ra6o data has ordered categories with equal intervals, and it also has a true zero point.
This allows for meaningful ra6os and mathema6cal opera6ons.
Examples:
• Height (e.g., 0 cm represents "no height")
• Weight (e.g., 0 kg means "no weight")

These dis6nc6ons are crucial in choosing appropriate sta6s6cal analyses. For instance,
nominal data may only allow for frequency counts and percentages, while ra6o data can be
used in complex mathema6cal opera6ons like calcula6ng means and standard devia6ons.

Remember, the type of data you have will influence the choice of sta6s6cal tests and
analyses you can perform on that data.

,Measures of central tendency (loca3on)
Mode: The mode is the value that occurs most frequently in a dataset. Unlike the mean and
median, the mode can be used with both numerical and categorical data.
A dataset can have one mode (unimodal), more than one mode (mul6modal), or no mode if
all values occur with the same frequency.
The only measure of central tendency for nominal variables.

Median: The median is the middle value when a dataset is sorted in ascending order. If there
is an even number of observa6ons, the median is the average of the two middle values. The
median is less affected by extreme values compared to the mean, making it a more robust
measure in such cases.
To be used for ordinal and higher measurement levels.

Mean: The mean is the most commonly used measure of central tendency. It is calculated by
adding up all the values in a dataset and then dividing by the number of observa6ons.
To be used for interval and ra6o variables only.

Importance and Relevance:
1. Summariza3on: Measures of central tendency provide a concise summary of a dataset,
making it easier to understand and compare different datasets.

2. Interpreta3on: They help in interpre6ng the meaning of the data by providing a "typical"
value around which the data tends to cluster.

3. Comparison: Central tendency measures allow for easy comparison between different
datasets or different parts of a dataset.

4. Decision-Making: They play a crucial role in decision-making processes, especially in fields
like economics, finance, and public policy.

When to Use:
You would use measures of central tendency when you want to:
- Provide a representa6ve value that summarizes a dataset.
- Compare different datasets or parts of a dataset.
- Make informed decisions based on the central value of the data.
It's important to note that the choice of which measure to use depends on the nature of the
data and the specific ques6ons you're trying to answer. For example, if your data contains
extreme values, the median might be a more appropriate measure of central tendency than
the mean.


Measures of Variability
Measures of variability, also known as measures of dispersion, quan6fy the spread or
dispersion of data points in a dataset. They provide informa6on about the extent to which

,the data deviates from the central tendency (mean, median, or mode). There are several
common measures of variability:

Range: The range is the simplest measure of variability. It is the difference between the
maximum and minimum values in a dataset. While easy to calculate, it is sensi6ve to outliers
and may not provide a comprehensive understanding of dispersion.
Interquar3le Range (IQR): The IQR is the range between the first quar6le (25th percen6le)
and the third quar6le (75th percen6le) of the data. It gives a measure of the spread of the
middle 50% of the data, making it less sensi6ve to extreme values than the range.
Variance: The variance is the average of the squared differences between each data point
and the mean. It quan6fies the overall variability in the dataset. However, because it
involves squaring the differences, the variance is in squared units, which can be less
interpretable.
Standard Devia3on: The standard devia6on is the square root of the variance. It is oaen
preferred over the variance because it is in the same units as the original data, making it
more interpretable. It represents the average devia6on of each data point from the mean.

, Coefficient of Varia3on (CV): The CV is the ra6o of the standard devia6on to the mean,
expressed as a percentage. It is used to compare the rela6ve variability between different
datasets, especially when the means are different.

Voordelen van het kopen van samenvattingen bij Stuvia op een rij:

Verzekerd van kwaliteit door reviews

Verzekerd van kwaliteit door reviews

Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!

Snel en makkelijk kopen

Snel en makkelijk kopen

Je betaalt supersnel en eenmalig met iDeal, creditcard of Stuvia-tegoed voor de samenvatting. Zonder lidmaatschap.

Focus op de essentie

Focus op de essentie

Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper kirstenzjyh. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €6,42. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 67096 samenvattingen verkocht

Opgericht in 2010, al 14 jaar dé plek om samenvattingen te kopen

Start met verkopen
€6,42  14x  verkocht
  • (1)
  Kopen