100% tevredenheidsgarantie Direct beschikbaar na betaling Zowel online als in PDF Je zit nergens aan vast
logo-home
Experimental Design and Analysis - Summary Slides €16,99
In winkelwagen

Samenvatting

Experimental Design and Analysis - Summary Slides

 2 keer bekeken  0 keer verkocht

A summary of all the slides for the course Experimental Design and Analysis, MSc AI.

Voorbeeld 4 van de 76  pagina's

  • 31 december 2024
  • 76
  • 2023/2024
  • Samenvatting
Alle documenten voor dit vak (2)
avatar-seller
tararoopram
Experimental Design and Data Analysis - Summary


Lecture 0
What is experimental design?
● Experiments are performed with varied preconditions represented by ind. variables, also
referred to as input variables or predictor variables.
● The change in predictors is hypothesized to result in a change in one or more dep.
variables, also referred to as output or response variables.
● The experimental design may also identify control variables that must be held constant
to prevent external factors from affecting the results.
● Experimental design involves also planning the experiment under statistically optimal
conditions given the constraints of available resources.
● Main concerns in experimental design: validity, reliability, replicability, achieving
appropriate levels of statistical power and sensitivity.
● Ronald Fisher: The Arrangement of Field Experiments (1926) and The Design of
Experiments (1935).

Experimental design, randomization
● Statistics allows to generalize from data to a true state of nature, but statistical inference
requires assumptions and mathematical modeling.
● The data should be obtained by a carefully designed experiment (or at least it must be
possible to think about the data in this way).
● Any good design involves a chance element: “experimental units” are assigned to
“treatments” by chance, or by randomization. The purpose is to exclude other possible
explanations of an observed difference.
● We need probability to quantify the randomization. In practice, randomization is
implemented with a random number generator. In R:




Examples, observational studies
1. To compare two fertilisers we prepare 20 plots of land, apply the first fertiliser to 10
randomly chosen plots and the second one to the remaining plots. We plant a crop and
measure the total yield from each plot.
2. To compare two web designs we randomly select 50 subjects and measure the time
needed to find some information. All 50 subjects perform this task with both designs, but
for each subject the order of the two designs is based on tossing a coin.
3. If an experiment involves subjects, then it could be wrong to assign “task A” to the first
10 subjects who arrive and “task B” to the last 10. (There may be a reason for arriving
early.) Instead assign the tasks at random. Then an observed difference is due to the
task (or chance).


1

,Experimental Design and Data Analysis - Summary


a. Data obtained by registering an ongoing phenomenon, without randomization or
applying other controls, is called observational.
4. The incidence of lung cancer among 500 smokers is observed to be higher than among
500 non-smokers. Does this finding generalize to the full population? Does this show
that smoking causes lung cancer?

Probability distributions: continuous, discrete
● A probability distribution P determines the probability of different outcomes of a
random variable.
● Probability distributions for:
○ discrete random variables which have finite or countable sets of possible
outcome values (e.g., dice, coins, birthdays);
○ continuous random variables which have infinite sets of possible outcome
values (e.g., temperature, length).
● The corresponding probability distributions: continuous, discrete.
● Note: There are distributions which neither continuous nor discrete.

Probability density functions
● Examples of the probability density p of some continuous distributions
(realised also in R with some default parameter values):
○ normal distribution norm with parameters μ mean=0 and σ sd=1




○ exponential distribution exp with parameter λ (lambda=1)



○ uniform distribution unif with parameters minimum (min=a) and
maximum (max=b) of the support interval



○ Gamma distribution gamma with parameters shape shape and
rate rate=1.

Probabilities of events – continuous distribution
● If a random variable X has a distribution with the density p(x), then



● In other words, the probability to have an outcome in some interval I is the area under
the density function p(x) over that interval.




2

,Experimental Design and Data Analysis - Summary


● Example. For X ∼ N(0,1),




● In events for continuous distributions:
< or ≤ (> or ≥) does not matter.

Location and scale, normal density
● Two important characteristics of a population are location
(or mean) µ and scale (or standard deviation) σ.
● The normal density curve is given by



● The parameters µ and σ are the location and scale. Normal
distributions with different µ and σ are still similar in a way.
● Note: The normal curve is very specific! There are many
“bell shaped” curves that are not normal.

Other symmetric and asymmetric densities




Probabilities and quantiles
● If a random variable X is distributed according to a density curve, the probability P(X ≤ u)
is the (red) area under the density curve left of u.
● Likewise, P(X ≥ u) is the (green) area under the density curve right of u




3

, Experimental Design and Data Analysis - Summary


● For distribution P, the quantile of level α ∈ (0, 1) is the number qα such that P(X ≤ qα) =
α, the upper quantile uα such that P(X ≥ uα) = α.
● For the standard normal distribution, the quantile and upper quantile are usually denoted
by ξα and zα.

Probability of events – discrete distribution
● For discrete distributions we have a probability mass function p
○ p(x) = P(X = x).
● The probability to have an outcome in some set A is the sum



● Examples of discrete distributions are binomial and Poisson




Probability mass functions for some discrete distributions
● Discrete distributions (realised also in R):
○ Binomial distribution binom with parameters n size and p prob



○ Poisson distribution pois with parameter λ lambda




Cumulative distribution/probability function
● The cumulative distribution function (CDF) (sometimes also called cumulative
probability function) of a random variable X is F(u) = P(X ≤ u) = pdist(u,par) (continuos
and discrete)

● Continuous distr.:
● Any other probability can be computed via F(u), e.g., for any a ≤ b,
P(a < X ≤ b) = P(X ≤ b) − P(X ≤ a) = F(b) − F(a).




4

Voordelen van het kopen van samenvattingen bij Stuvia op een rij:

Verzekerd van kwaliteit door reviews

Verzekerd van kwaliteit door reviews

Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!

Snel en makkelijk kopen

Snel en makkelijk kopen

Je betaalt supersnel en eenmalig met iDeal, creditcard of Stuvia-tegoed voor de samenvatting. Zonder lidmaatschap.

Focus op de essentie

Focus op de essentie

Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper tararoopram. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €16,99. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 48298 samenvattingen verkocht

Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen

Start met verkopen
€16,99
  • (0)
In winkelwagen
Toegevoegd