Samenvatting

Samenvatting Research Training II Tutorials (BPN1104)

Name: Samenvatting Research Training II Tutorials (BPN1104)
SKU: doc_1960722
Rating: 5.00 (1 reviews)
Author: jodiesilvius

1 beoordeling

2 keer verkocht

Instelling
Erasmus Universiteit Rotterdam (EUR)

Alle codes van R studio die zijn uitgelegd in het vak Onderzoeksvaardheden II samengevat in een overzicht. All codes of R Studio that are used and explained in the course of Research Methods II.

[Meer zien]

Voorbeeld 3 van de 16 pagina's

Bekijk voorbeeld

Geupload op 14 september 2022
Aantal pagina's 16
Geschreven in 2021/2022
Type Samenvatting

1 beoordeling

Door: gabrielanwillems • 1 jaar geleden

Volgen

jodiesilvius Lid sinds 2 jaar 10 documenten verkocht

€10,39

Ook beschikbaar in voordeelbundel v.a. €17,99

In winkelwagen

Op verlanglijstje

100% tevredenheidsgarantie
Direct beschikbaar na je betaling
Lees online óf als PDF
Geen vaste maandelijkse kosten

Ook beschikbaar in voordeelbundel (1)

Gebundelde samenvatting Research Training 2 theorie en codes

€ 22,38 € 17,99

5x verkocht

2 items

1. Samenvatting - Samenvatting research training ii tutorials (bpn1104)
2. Samenvatting - Samenvatting research training ii theorie (bpn1104)
Meer zien

Onderzoeksvaardigheden II – Tutorials

Module 1

#Package “gmodels” for tables

Library(gmodels)
Library(ggplot2)
Library(stargazer)
Library(psych)

dir <- "~/Documents/ERASMUS
RSM/Onderzoeksvaardigheden/Tutorials/Data" /"

dirData <- paste0(dir, "Data/")
dirProg <- paste0(dir, "Programs/")
dirRslt <- paste0(dir, "Results/")

Ontbrekende waarden / missing values:

colSums(is.na(dsTitanic))

DATA FACTORS

dsLiving$fLiving <- factor(dsLiving$cLiving, levels=c(1:5), labels=c(“Student housing”,
“Private rent”, “Parents”, “Own house”, “Other”))
levels(dsLiving$fLiving)

when you turn it into a factor then you have to specify the number of outcomes that’s where
the “levels” is for. R understands that this is categorical data, which is necessary when later
on making use of graphics or other analyses.

FREQUENCY TABLES AND VISUALISATION
Visualization of the information between two qualitative variables

#Frequency table
Use of function tables
Table(dsLiving$cLiving)
Table(dsLiving$fFraternity)
Table(‘living situation’ = dsLiving$fLiving, Membership = dsLiving$dFraternity)

#Use of function xtabs
It generates the same table, but operates different. Structured and layout.
Tbl <- xtabs( ~ cLiving + dFraternity, data = dsLiving)

#make a table with margins totals (as in the slides of Module 1, totals of rows and columns)
Addmargins(tbl)

,#GGplot for these tables bij sommige data neemt de x-as niet optie 1 of 2 maar een
numerieke maat, dit is fout.
Ggplot(dsLiving, aes(x = fliving)) + geom_bar(fill = “orange”, col= “black”) + xlab(“Living
conditions (cliving)”)
Ggsave(paste0(dirRslt, “Tutorial01.pdf”), width = 8, height = 6)

#make grouped tables/bar charts with GGplot

Ggplot(dsLiving, aes(x=fFraternity, fill=fliving)) + geombar(position= “dodge”) +
ylab(“Frequentie”) + xlab(“Lidmaatschap”) + scale_fill_brewer(“Woonsituatie”, palette=
“Set1”)
Ggsave function

The dodge function makes the table put the data next to each other instead of stacked on
top of each other, which makes it easier to interpret data. Woonsituatie is stating the legend
and the colours the bars will have.

ANALYSIS OF STATISTICAL INDEPENDENCE between 2 qualitative variables (categorical data)
#Chi square test

Step 1 make a frequency table
Tbl <- table(dsLiving$cliving, dsLiving$dFraternity)

Step 2 the chi square test
Chisq.test(tbl) 1st value is observed value, degrees of freedom & p-value. The approximation
of the Chisquare test is better the larger the sample size (observed values). (rule of 5!)
However, this cannot be checked by the frequency tables because these are observed and
not the expected frequencies.

Step 3 extract information from object. To check the expected frequencies
RsltChisq <- chisq.test(tbl)
Str(rsltChisq)

With this you can already see the observed and expected values

rsltChisq$statistic  observed value of the statistic
rsltChisq$parameter  degrees of freedom
rsltChisq$p.value  p-value

round(cbind(rsltChisq$observed, rsltChisq$expected, 3)

Step 4 find cells with expected frequencies below 5 OPTION 1

Which(rsltChisq$expected < 5)
Which(rsltChisq$expected < 5, arr.ind = TRUE) arr.ind makes it more clear which row and
column the value below 5 is in

, Step 5 remove rows with sparse outcomes reanalyse the relationship between the variables.

dsLiving.tmp <- dsLiving[!(dsLiving$cLiving ==5), c(“cLiving”, “dFraternity”)]

Step 6 re-make the frequency table
Tbl <- table(dsLiving.tmp$cLiving, dsLiving$dFraternity)

Step 7 Find the chi-square test results
Chisq.test(tbl)
The warning message will not show anymore.

OPTION 2 – combining rows with sparse outcomes (to leave out <5)
Step 1 – copy data to temporary data frame
DsLiving.tmp <- dsLiving[c(“cLiving”, “dFraternity”)]

Step 2- adjust the value
dsLiving.tmp$cLiving[dsLiving.tmp$cLiving==5] <-4 combining outcome 5 with outcome 4

Step 3 – remake the frequency table
Tbl <- table(dsLiving.tmp$cLiving, dsLiving.tmp$dFraternity)

Step 4 – Find the chisquare test results
Chisq.test(tbl)

Also no warning message.

#if the expected frequencies are falling short of the rule of 5 then we cannot use chisq and
you cannot combine rows/columns with a 2x2 table and all expected values are above 5
 Yates continuity correction - contingency analysis

Step 1 make a frequency table
Tbl <- table(dsLiving$dSports, dsLiving$dFraternity)

Step 2 Find the chisq test results
Chisq.test(tbl)
Tmp <- chisq.test(tbl)

Step 3 use phi see slides for uitleg
Phi <- sqrt(tmp$statistic/sum(tbl))

R automatically applies this correction of ½ in the formula.

# 2x2 table but the expected values are still below 5
Fisher’s exact test
Step 1 make a frequency table
Tbl <- table(dsLiving$dSports, dsLiving$dFraternity)

Dit zijn jouw voordelen als je samenvattingen koopt bij Stuvia:

Bewezen kwaliteit door reviews

Studenten hebben al meer dan 850.000 samenvattingen beoordeeld. Zo weet jij zeker dat je de beste keuze maakt!

In een paar klikken geregeld

Geen gedoe — betaal gewoon eenmalig met iDeal, creditcard of je Stuvia-tegoed en je bent klaar. Geen abonnement nodig.

Direct to-the-point

Studenten maken samenvattingen voor studenten. Dat betekent: actuele inhoud waar jij écht wat aan hebt. Geen overbodige details!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper jodiesilvius. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €10,39. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 69411 samenvattingen verkocht

Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen

Begin nu gratis

Samenvatting

Samenvatting Research Training II Tutorials (BPN1104)

Document informatie

Onderwerpen

Geschreven voor

1 beoordeling

Verkoper

Ontvangen beoordelingen

Voorbeeld van de inhoud

Dit zijn jouw voordelen als je samenvattingen koopt bij Stuvia:

Bewezen kwaliteit door reviews

In een paar klikken geregeld

Direct to-the-point

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?