Samenvatting

Samenvatting Statistiek tentamen

Beoordeling

Verkocht

Pagina's

Geüpload op

15-11-2021

Geschreven in

2018/2019

Samenvatting van de relevante hoofdstukken van statistiek

Instelling

Vak

Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Meld schending auteursrecht

Gekoppeld boek

James Mcclave, Terry Sincich Statistics

Uitgave:Onbekend
ISBN:9780130655981
Druk:9

Geschreven voor

Instelling: Universiteit Twente (UT)
Studie: Industrieel Ontwerpen
Vak: Statististics (202000186)

Alle documenten voor dit vak (1)

Documentinformatie

Heel boek samengevat?: Nee
Wat is er van het boek samengevat?: 1,2,3,4,5,6,7,8,9,14
Geüpload op: 15 november 2021
Aantal pagina's: 23
Geschreven in: 2018/2019
Type: Samenvatting

Onderwerpen

statistics
qualitative data
sampling distribution
box plot
probability
additive rule
random variable
probability distribution
random variables
uniform distribution
normal distribution

Voorbeeld van de inhoud

Chapter 1 – Statistics, Data and Statistical Thinking
1.3 Fundamental Elements of Statistics
Statistical methods are particularly useful for studying, analysing, and learning about populations of
experimental units.

An experimental unit is an object (e.g., person, thing, transaction, or event) about which we collect
data.
A population is a set of all units (usually people, objects, transactions, or events) that we are interested
in studying. Example: all working people in USA
In studying a population, we focus on one or more characteristics or properties of the units in the
population. We call such characteristics variables.

A variable is a characteristic or property of an individual experimental (or observation) unit in the
population. Age, gender, years of education etc.
Measurement is the process we use to assign numbers to variables of individual population units.
Census of the population is when measuring a variable for every unit of a population.
Sample is a subset of the units of a population.

1.4 Types of Data
All data can be classified as one of two general types:
- Quantitative data are measurements that are recorded on a naturally occurring numerical scale.
o The temperature at which each piece in a sample begins to melt
o The current unemployment rates
o The number of convicted murderers who receive the death penalty
- Qualitative data are measurements that cannot be measured on a natural numerical scale; they
can only be classified into one of a group of categories.
o A taste tester’s ranking of four brands of sauce (best, worst)
o The political party affiliation in a sample of 50 voters (Democrat, Republican)
o Closed at night (yes or no)

,Chapter 2 – Methods of Describing Sets of Data
2.1 Describing Qualitative Data
Class is one of the categories into which qualitative data can be classified.
Class frequency is the number of observations in the data set that fall into a particular class.
The class relative frequency is the class frequency divided by the total number of observations in the
data set. (Class frequency / n)
The class percentage is the class relative frequency x 100

Graphical Descriptive Methods for Qualitative Data
- Bar graph: the categories of the qualitative variable are represented by bars, where the height
of each bar is either the class frequency, class relative frequency or class percentage
- Pie Chart: the categories (classes) of the qualitative variable are represented by slices of a
pie. The size of each slice is proportional to the class relative frequency.
- Pareto Diagram: A bar graph with the categories (classes) of the qualitative variable
arranged by height in descending order from left to right.

2.2 Graphical Methods for Describing Quantitate Data
To describe, summarize and detect patterns in such data, we can use three graphical methods:
1. Dot plots: The numerical value of each measurement in the data set in
located on the horizontal scale by a dot. When data values repeat, the dots are
placed above another.
2. Steam-and-leaf display: The stem is the portion of the measurement to the
left of the decimal point, while the remaining portion, to the right of the decimal point,
is the leaf. The stems for the data set are listed in the second column. Then the leaf for
each observation is listed to the right.
3. Histograms. The possible numerial values of the quantitative variable are
partitioned into class intervals, each of which has the same width. These intervals from
the scale of the horizontal axis. The frequency or relative frequency of observations in
each class interval is determined. A vertical bar is placed over each class interval, with
the height of the bar equal to either the class frequency or class relative frequency.
2.3 Numerical Measures of Central Tendency
The mean of a set quantitative data is the sum of the measurements, divided by the number of
measurements contained in the data set. We denote the mean of a sample of measurements by 𝑥𝑥̅ . For
the mean of a population, we use a different symbol: µ

The median of a quantitative data set is the middle number when the measurements are arranged in
ascending (or descending) order. We denote the median of a sample of measurements, by M. For the
median of a population, we use a different symbol: η
A data set is said to be skewed if one tail of the distribution has
more extreme observations than the other tail.

The mode is the measurement that occurs most frequently in the
data set.

The measurement class containing the largest relative frequency is
called the modal class.

2.5 Numerical Measures of Variability
Range is equal to the largest measurement – the smallest
measurement.

Deviation is the distance between each measurement and the mean.

, The sample variance is for a sample of n measurements equal to the sum of the squared deviations
from the mean divided by n. The symbol 𝑠𝑠 2 is used to represent the sample variance.

The sample standard deviation, s, is defined as the positive square
root of the sample variance 𝑠𝑠 2 S = √ S²

s² = sample variance (sample variance)
s = sample standard deviation
σ = population standard deviation
σ² = population variance
Chebyshev rule applies to any data set, regardless of the shape of the frequency distribution of the
data.
Empirical rule: is a rule of thumb that applies to data sets with frequency distributions that are mound
shaped and symmetric.
Chebyshev’s rule Empirical rule
(µ-σ, µ+σ) At least 0% ≈68%
(µ-2σ, µ+2σ) At least 75% ≈95%
(µ-3σ, µ+3σ) At least 89% ≈All

2.6 Numerical Measures of Relative Standing
Pth percentile is a number such that p% of the measurements fall below that number and (100-p%) fall
above it.

Percentiles that partition a data set into four categories, each category containing exactly 25% of the
measurements are called quartiles.
The lower quartile (𝑄𝑄𝐿𝐿 ) – 25% of the data
The middle quartile (M) – the median of 50%
The upper quartile (𝑄𝑄𝑈𝑈 ) – 75% of the data

Z-score gives the relative location of the measurement.

Interpretation of z-scores for Mound-Shaped distributions of data
1. Approximately 68% of the measurements will have a z-score between the -1 and 1
2. Approximately 95% of the measurements will have a z-score between the -2 and 2
3. Approximately 99.7% of the measurements will have a z-score between the -3 and 3

2.7 Methods for Detecting Outliers: Box plots and z-scores
Outliers: is an observation that is unusually large or small relative to the other values in a data set.
Outliers typically are attributable to one of the following causes:
1. The measurement is observed, recorded, or entered into the computer incorrectly
2. The measurement comes from a different population
3. The measurement is correct but represents a rare event

€5,99

Krijg toegang tot het volledige document:

100% tevredenheidsgarantie

Direct beschikbaar na je betaling

Lees online óf als PDF

Geen vaste maandelijkse kosten

Maak kennis met de verkoper

nienkevanleeuwe

Maak kennis met de verkoper

nienkevanleeuwe Erasmus Universiteit Rotterdam

Bekijk profiel

Volgen

Verkocht

Lid sinds

4 jaar

Aantal volgers

Documenten

Laatst verkocht

0,0

0 beoordelingen

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper nienkevanleeuwe. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €5,99. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews) Afgelopen 30 dagen zijn er 44104 samenvattingen verkocht Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen

Samenvatting Statistiek tentamen

Gekoppeld boek

Geschreven voor

Documentinformatie

Onderwerpen

Voorbeeld van de inhoud

Meer vakken binnen Universiteit Twente (UT) > Industrieel Ontwerpen

Maak kennis met de verkoper

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Niet tevreden? Kies een ander document

Betaal zoals je wilt, start meteen met leren

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?