100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
Statistics and Data Analysis (in Collab) final exam notes $11.65   Add to cart

Class notes

Statistics and Data Analysis (in Collab) final exam notes

 42 views  0 purchase
  • Course
  • Institution

Statistics uses mathematical tools to organize and summarize data obtained from the real world and draw conclusions from a correct interpretation of these data. In the business world, statistics can help assess the attractiveness of a business opportunity, increase customer satisfaction, choose bet...

[Show more]

Preview 3 out of 28  pages

  • February 23, 2023
  • 28
  • 2020/2021
  • Class notes
  • Giovanna lamastra pacheco
  • All classes
avatar-seller
APUNTES STATS FINAL

1. Types of Variables – Python




2. Python Libraries


The most important libraries we will use during this course are:

• numpy (np): for high-level mathematical functions/numerical analysis
• scipy.stats (ss): for probability distributions
• pandas (pd): for data structuring and manipulation
• matplotlib.pyplot (plt): for plots

3. Pandas

Some useful functions to always remember:

• describe(): summary statistics for each column of the dataset
• head(): print the first 5 rows of the data set
• tail(): print the last 5 rows of the dataset
• dtypes: type of variable in each column
• shape: number of rows and columns

, 4. Manipulation with Pandas

Filter rows (slicing), example: assume we want to select only students with age
between 30 and 33 included.




Create a new column, example: assume we want to add a new column, recording the
gender of students. Here you can see the gender of the 3 students: male, female, male




We want to assign a letter grade to students, with the following rule:

• gpa >= 8 letter grade A

• 6 <= gpa<8 letter grade B

• gpa < 6 letter C

, Sort columns, example: assume we want to sort the data frame with respect to the value
of a given column. We want to get the data frame sorted with respect to the gpa, in
decreasing order:




5. Graphs

We can describe categorical variables using frequency distribution tables and graphs
such as bar charts, pie charts and histograms.



FREQUENCY DISTRIBUTION TABLE

A frequency distribution is a table used to organize data. The left column (called
classes) includes all possible responses to a variable being studied. The right column is
a list of the frequencies (number of observations of each class);

A cumulative frequency distribution contains the total number of observations whose
values are less than the upper limit for each class. It is used to determine the number of
observations that lie above (or below) a particular value;

A relative/percentage frequency distribution is obtained by dividing each frequency
by the total number of observations (n). It can be expressed as a percentage;

A relative/percentage cumulative frequency distribution is the quotient between the
cumulative frequency of a particular value and the total number of observations (n). It
can be expressed as a percentage.

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller martaescrivderomancebrin. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $11.65. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

83750 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$11.65
  • (0)
  Add to cart