100% tevredenheidsgarantie Direct beschikbaar na betaling Zowel online als in PDF Je zit nergens aan vast
logo-home
Summary Statistical Data Analysis CH 6, 7, 8 €4,99   In winkelwagen

Samenvatting

Summary Statistical Data Analysis CH 6, 7, 8

 14 keer bekeken  0 keer verkocht

Everything you need to know for DSA chapters 6, 7 and 8!

Voorbeeld 2 van de 10  pagina's

  • 21 maart 2022
  • 10
  • 2021/2022
  • Samenvatting
Alle documenten voor dit vak (3)
avatar-seller
femkestokkink
Femke Stokkink Statistical Data Analysis
2666619
2022-03-21
Summary



1 Chapter 6: Nonparametric Methods
1.1 The One Sample Problem

Nonparametric tests:
• Nonparametric tests make no (parametric) assumptions about the underlying distribution F of the data.
So, for example no normality assumption.
• These tests are applicable for broad classes of distributions, and have actual level α. The distribution of
the test statistic under H0 is the same for each distribution F belonging to H0 .
• Nonparametric tests are robust with respect to the level: they have the intended level α for a large class
of distributions.
• Nonparametric tests are more efficient (have higher power) than parametric tests when the (normality)
assumptions are not fulfilled.

One sample problem: assume we have a sample X1 ,...,Xn from an unknown distribution F , and we want to
test the location of F . Which test would you use?

T-test:
• Assumption: X1 ,...,Xn ∼ N(µ,σ 2 )
• Null hypothesis H0 : µ = µ0

• Test statistic T = n X̄−µ
Sx
0



• Distribution: under H0 we have T ∼ tn−1 .
• This is a parametric test (assumes normality) for a composite H0 , consisting of all normal distributions
with expectation µ0

Sign Test:
• Assumption: underlying distribution F has a unique median m, such that P(Xi < m) = P(Xi > m) = 12 .
• Null hypothesis H0 : m = m0 . This is a composite null hypothesis.
• Test statistic T = #(Xi > m0 ) = Σni=1 1Xi >m0
• Distribution: under H0 we have T ∼ bin(n, 12 ). This is a nonparametric test, since T has this distribution
for all F in H0 .
• In case k of the Xi ’s are equal to m0 , delete these k values and perform the test conditionally on k values
equal to m0 , and T ∼ bin(n - k, 21 ) under H0 .

(Wilcoxon) Signed Rank Test:
• Assumption: Underlying distribution F is continuous and symmetric around m.
• Null hypothesis H0 : m = m0 . This is a composite null hypothesis.
• Test statistic V is based on the ranks Ri of the absolute differences |Xi - m0 |. V = Σni=1 Ri sgn(Xi − m0 )
• Distribution: relatively large values of V indicate that m is larger than m0 . Under H0 , V is distributed as
Σni=1 Qi Ri with
– Qi random variable, P(Qi = -1) = P(Qi = 1) = 12 .
– R1 ,...,Rn a random permutation of {1,...,n}.
• Since this distribution is the same for all distributions under H0 , this is a nonparametric test.
• Wilcoxon test is usually more powerfull


1

, 1.2 Asymptotic Efficiency

A more efficient test needs fewer observations to obtain the same power as the less efficient test. Like in the
case of (robust) estimators we will consider the asymptotic case for the number of observations n tending to
infinity, and we will see that the asymptotic variance (now of the test statistic) plays a role in determining the
asymptotic efficiency.

Let X1 ,...,Xn be observations from a distribution F . According to H0 , F belongs to a class F0 (for example, all
distributions with median m), whereas according to H1 , F belongs to a class F1 . The power of a test is the
function π(F ) = PF (H0 is rejected).

For a test to be good π(F ) should be small when F ∈ F0 and large when F ∈ F1 .

It is difficult to compare the power of two tests for every possible F , therefore we focus on shift alternatives:
alternatives that can be obtained by shifting a distribution that belongs to F0 over a certain distance θ. Such
alternatives therefore have a location that is shifted over this distance θ, whereas the scale stays the same. We
limit to right-sided testing problems, so θ > 0.

If, for example, one would consider hypotheses about the median, F0 could be a distribution with median m0 ,
and then F0 would have median m0 + θ. The power for the class of shift alternatives F0 can be written as
πn (θ) = Pθ (H0 is rejected).

For a suitable right-sided test the value πn (0) of the power function under H0 is small, whereas πn (θ) is ‘large’
for θ > 0, so under the alternative hypothesis.

Suppose that H0 is rejected for a large value of the test statistic Tn and assume Tn is asymptotically normally
distributed. This means that for large n, Tn is approximately distributed as N (µ(θ), σ 2 (θ)/n), when θ is the
true value of the parameter. here µ(θ) is the asymptotic mean and σ 2 (θ) the asymptotic variance of Tn
√ 0
The power of the test can then be rewritten to: πn (θ) ≈ 1 − Φ(ξ1−α − n µσ(0)
(0)
θ)

• ξ1−α is the 1-α quantile of the N(0,1) distribution.
µ0 (0)
• σ(0) is the slope of the test. The larger the slope, the better the test.

• A sequence of tests Tn is consistent when for a fixed α the power tends to 1 for each alternative when n
goes to ∞. In this case the sequence of tests is consistent for the shifted alternatives if µ(θ) > µ(0).

0
The asymptotic relative efficiency of the test Tn with respect to T̃m = are(Tn ,T̃m ) = ( µµ̃0 (0)/σ(0) 2
(0)/σ̃(0) ) =
m
n

• Here T̃m is the test statistic of a second test
• If are(Tn ,T̃m ) > 1, then Tn is more efficient than T̃m .


If for a certain sample the a.r.e of two tests is 2/π, then in order to obtain similar power in both tests, the
sample size ratio should be nntest1
test2
= π2 = 0.64

1.3 Two Sample Problems

Two samples can either be paired or independent:
• Paired samples: (X1 , Y1 ), (X2 , Y2 ),..., (Xn , Yn )
• Independent samples: X1 , X2 , ..., Xm and Y1 , Y2 , ..., Yn
For paired samples: if one is interested whether one sample is stochastically larger, consider differences Zi =
Yi - Xi . This then becomes a one sample problem.

Median test:
• Assumption: X1 , .., Xm ∼ F, Y1 , .., Yn ∼ G, F and G continuous and both are i.i.d.


2

Voordelen van het kopen van samenvattingen bij Stuvia op een rij:

Verzekerd van kwaliteit door reviews

Verzekerd van kwaliteit door reviews

Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!

Snel en makkelijk kopen

Snel en makkelijk kopen

Je betaalt supersnel en eenmalig met iDeal, creditcard of Stuvia-tegoed voor de samenvatting. Zonder lidmaatschap.

Focus op de essentie

Focus op de essentie

Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper femkestokkink. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €4,99. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 75759 samenvattingen verkocht

Opgericht in 2010, al 14 jaar dé plek om samenvattingen te kopen

Start met verkopen
€4,99
  • (0)
  Kopen