100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
Samenvatting en Aantekeningen Business Statistics $10.60
Add to cart

Summary

Samenvatting en Aantekeningen Business Statistics

 26 views  1 purchase
  • Course
  • Institution

Kwaliteits samenvatting van alle hoorcolleges van het vak business statistics. Bij de samenvatting zit bij elke distribution naast alle informatie, een voorbeeld en het bijbehorende R command. Alles is duidelijk met kleuren en dikgedrukte woorden aangegeven. Had zelf een 8 voor dit tentamen.

Preview 3 out of 20  pages

  • November 28, 2021
  • 20
  • 2020/2021
  • Summary
avatar-seller
Business statistics notes

Lecture 1

• Data
• Numerical (quantitative)
• Continuous (14,271…)
• Discrete (1,2…)
• Categorical (qualitative, can be coded numerical)
• Coded
• Verbal label

• Measurement level




• Data matrix/frame
• Columns: variables
• Rows: subjects/cases
• Cells: observations

• WARNINGS
• Coding has no impact on variable type
• Add meta-data to dataset, vocabulary with all variable descriptions and coding
themes
• Missing data is often coded (NA) or deleted or imputed (guessed)
• Outliers are observations that show substantially dissimilar behavior from the
bulk of the data, influence the mean/other outcomes heavily
• Check whether your results change with versus without the outliers
and report and interpret this
• Delete
• Censor at 99% or same value

,• Measure of centre
• mean ( )
Excel function: average
• The median ( )
• The minimum and maximum observation
• The mid-range:
average of the minimum and maximum observation (sensitive measure to
outliers)
• The 25th and 75th (and other) percentiles: 𝑝th percentile for us will be the

• Geometric mean:

Excel function: geomean
• 𝒌% trimmed mean: take the sample mean discarding the 𝑘% highest and
lowest observations (to protect yourself from outliers, but use more info than
the median)
* We might want to rely on the median, because the median is a more easily
understood and recognized measure of central tendency (actually: only use
geometric mean in the context of growth rates.

• Measures of variability or spread
• The sample variance 𝑠 ! (for population excel function: VAR) and standard
deviation 𝑠 = √𝑠 ! (excel function: STDEV)

n n
• Range:
• Interquartile range:
• Mean Absolute Deviation:
• Frequency:

• Skewness
measure of asymmetry




• Kurtosis
measure of tail flatness/fatness
large: more chance of outliers/huge outcomes
Mainly used as a benchmark for normality or symmetry: Kurt≈3
NB: Excess Kurtosis = Kurtosis –3
So 𝐾𝑢𝑟𝑡≈3 is the same as 𝐸𝑥𝑐𝑒𝑠𝑠𝐾𝑢𝑟𝑡≈0
Beware: R gives kurtosis, while Excel gives excess kurtosis

, Lecture 2

• Probability P(A)
- (0 < P(A) < 1)
if (probability > 1 or < 0, then probability = 0)
- A is an event, A’ denotes not an event

- Odds for a
Odds against a
- 𝑃 (𝐴 ∪ 𝐵): probability of either A or B or both happening
𝑃 (𝐴 ∩ 𝐵): means probability of both A and B happening jointly
- General law of addition: 𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) – 𝑃 (𝐴∩𝐵)
- Conditional probability: 𝑃(𝐴/𝐵) = 𝑃(𝐴∩𝐵)/𝑃(𝐵) only in the given order!
- General law of multiplication: 𝑃(𝐴∩𝐵) = 𝑃(𝐴/𝐵)𝑃(𝐵) = 𝑃(𝐵/𝐴)𝑃(𝐴)
- Disjoint addition: 𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵)
Disjoint: events 𝐴 and 𝐵 in the sample space that have no overlap (no A∩B)
- 𝑃(𝐴) & 𝑃(𝐵): P(A)= P(A∩B) + P(A∩B’)
both need to be computed out of joint probability
- Bayes theorem:



Independence condition 1:
Independence condition 2:
*(they can happen individually/
no connection to another event's chances of happening)


Permutation:
When events cannot
Combination: occur at the same
time, they are
Factorial: called mutually
exclusive.
- Mutually exclusive events: if two events cannot occur at the same time. On the other hand, if
each event is
*Independent can’t be mutually exclusive (and the other way around) unaffected by other
- False positive: P(A / W’) alarm, no weapon events, they are
called independent
- False negative: P(A’ / W) no alarm, weapon events.

• sample correlation coefficient




Its range is -1 < r < +1.
Relation is positive/negative, strong/weak and linear/exponential
Excel function: CORREL
* The correlation for (𝑋, 𝑌) does not change if we replace the data by (𝑎𝑋, 𝑏𝑌) if 𝑎 >
0 and 𝑏 > 0.

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller melissabakker2. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $10.60. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

59063 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 15 years now

Start selling
$10.60  1x  sold
  • (0)
Add to cart
Added