100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4.2 TrustPilot
logo-home
College aantekeningen

Statistics and Data Analysis (in Collab) final exam notes

Beoordeling
-
Verkocht
1
Pagina's
28
Geüpload op
23-02-2023
Geschreven in
2020/2021

Statistics uses mathematical tools to organize and summarize data obtained from the real world and draw conclusions from a correct interpretation of these data. In the business world, statistics can help assess the attractiveness of a business opportunity, increase customer satisfaction, choose between different investment possibilities, analyze and improve production processes, etc. Students following this course learn how to define the data required in different situations characterized by uncertainty, collect and summarize these data, and make decisions based on data analysis. This notes also provide the theoretical and practical bases for other systems in the degree.

Meer zien Lees minder
Instelling
Vak










Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Geschreven voor

Instelling
Studie
Vak

Documentinformatie

Geüpload op
23 februari 2023
Aantal pagina's
28
Geschreven in
2020/2021
Type
College aantekeningen
Docent(en)
Giovanna lamastra pacheco
Bevat
Alle colleges

Onderwerpen

Voorbeeld van de inhoud

APUNTES STATS FINAL

1. Types of Variables – Python




2. Python Libraries


The most important libraries we will use during this course are:

• numpy (np): for high-level mathematical functions/numerical analysis
• scipy.stats (ss): for probability distributions
• pandas (pd): for data structuring and manipulation
• matplotlib.pyplot (plt): for plots

3. Pandas

Some useful functions to always remember:

• describe(): summary statistics for each column of the dataset
• head(): print the first 5 rows of the data set
• tail(): print the last 5 rows of the dataset
• dtypes: type of variable in each column
• shape: number of rows and columns

, 4. Manipulation with Pandas

Filter rows (slicing), example: assume we want to select only students with age
between 30 and 33 included.




Create a new column, example: assume we want to add a new column, recording the
gender of students. Here you can see the gender of the 3 students: male, female, male




We want to assign a letter grade to students, with the following rule:

• gpa >= 8 letter grade A

• 6 <= gpa<8 letter grade B

• gpa < 6 letter C

, Sort columns, example: assume we want to sort the data frame with respect to the value
of a given column. We want to get the data frame sorted with respect to the gpa, in
decreasing order:




5. Graphs

We can describe categorical variables using frequency distribution tables and graphs
such as bar charts, pie charts and histograms.



FREQUENCY DISTRIBUTION TABLE

A frequency distribution is a table used to organize data. The left column (called
classes) includes all possible responses to a variable being studied. The right column is
a list of the frequencies (number of observations of each class);

A cumulative frequency distribution contains the total number of observations whose
values are less than the upper limit for each class. It is used to determine the number of
observations that lie above (or below) a particular value;

A relative/percentage frequency distribution is obtained by dividing each frequency
by the total number of observations (n). It can be expressed as a percentage;

A relative/percentage cumulative frequency distribution is the quotient between the
cumulative frequency of a particular value and the total number of observations (n). It
can be expressed as a percentage.
$12.69
Krijg toegang tot het volledige document:

100% tevredenheidsgarantie
Direct beschikbaar na je betaling
Lees online óf als PDF
Geen vaste maandelijkse kosten

Maak kennis met de verkoper
Seller avatar
martaescrivderomancebrin

Maak kennis met de verkoper

Seller avatar
martaescrivderomancebrin IE University
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
3
Lid sinds
2 jaar
Aantal volgers
0
Documenten
10
Laatst verkocht
1 jaar geleden

0.0

0 beoordelingen

5
0
4
0
3
0
2
0
1
0

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen