INTRODUCTION TO STATISTICS
CHAPTER 1 – Introduction to Statistics
1. Statistics, science, observations
1.1 definitions
statistics consist of facts and figures
- informative and time-saving
- condense large quantities of information into simple figures
- the term “statistics” refers to a general field of mathematics
- statistics is a shortened version of “statistical procedures”
- research involves gathering information
statistics serve two general purposes:
1) to organize and summarize information, so that the researcher can
see what happened in the research study and can communicate the
results to others
2) to help the researcher answer the questions that initiated the
research by determining exactly what general conclusions are justified
based on the specific results that were obtained
statistics – refer to a set of mathematical procedures for organizing,
summarizing and interpreting information
statistical procedures help ensure that the information is/observations
are presented and interpreted in an accurate and informative way
statistics provide researchers with a set of standardized techniques
- they are recognized throughout the scientific community
- each separate researcher is familiar with the used statistical methods
for any analysis
1.2 populations and samples
research begins with a general question about a specific group of
individuals
the group of interest is called a population
population – the set of all individuals of interest in a particular study
populations are defined by the investigators and their size depends on
what problem the researcher is looking into
the population does not necessarily consist of people – it could be
animals, inanimate objects, corporations
populations are typically very large
the population being studied should always be identified by the
researcher
researchers usually select a more manageable group from the
population and limit their studies to these selected individuals –
sampling
sample – a set of individuals selected from a population, usually
intended to represent the population in a research study
- it is better for it to be representative
- a sample should always be identified in terms of the population from
which it was selected
when a sample has been examined, the goal is to generalize the results
back to the entire population in order to answer the general question
about the population
1.3 variables and data
, researchers are usually interested in specific characteristics of the
individuals in the population/sample or they are interested in outside
factors that may influence the individuals
variable – a characteristic/ condition that changes or has different
values for different individuals
ex. height, weight, gender, personality, environmental conditions
(temperature, time of the day)
in order to demonstrate changes in the variables, it is necessary to
make measurements of the variables being examined
datum/ score/ raw score – the obtained single measurement/
observation for each individual
data set/ data – the complete set of scores
- data – measurements or observations
- data set – a collection of measurements/observations
population/samples can refer to both individuals and scores
- research involves measuring each individual to obtain a score,
therefore every population/ sample of individuals produces a
corresponding population/sample of scores
1.4 parameters and statistics
it is necessary to distinguish whether the data came from a population
or from a sample
parameter – a value (usually numerical)/ characteristic, that describes
a population
- usually derived from measurements of the individuals in the
population
ex. the average score for a population
statistic – a value (usually numerical)/ characteristic, that describes a
sample
- usually derived from measurements of the individuals in the sample
ex. the average score for a sample
research begins with a question about a population parameter
actual data come from a sample and are used to compute sample
statistics
every population parameter has a corresponding sample statistic
statistics from samples are the basis for answering questions about
population parameters
1.5 descriptive and inferential statistical methods
descriptive statistics – statistical procedures used to summarize,
organize and simplify data
- take raw scores and organize/summarize them in a more manageable
form
- the scores are organized in a table/ graph which makes it possible to
see the entire set of scores
- a common technique is to summarize a set of scores by computing an
average
! the average provides a single descriptive value for the entire set
(even if the data set has hundreds of scores)
inferential statistics – consist of techniques that allow us to study
samples and then make generalizations about the populations from
which they were selected
, - use sample data to make general statements about a population
sample statistics are used as the basis for drawing conclusions about
population parameters
samples are selected to be representative
samples provide only limited information about the population
- a sample is not expected to give a perfectly accurate picture of the
whole population
sampling error – the naturally occurring discrepancy/ error (by
chance), that exists between a sample statistic and the corresponding
population parameter
- creates the fundamental problem inferential statistics must always
address
statistics vary from one sample to another because of the specific
people’s characteristics in each sample
it is unlikely that the statistics obtained for a sample will be identical to
the parameters for the entire population
the basic concept of sampling error: sample statistics vary from
one sample to another and typically are different from the
corresponding population parameters
example of sampling error is the error associated with a sample
proportion
ex. a political poll’s results with a +/- 4% (plus-or-minus 4 percentage
points) margin of error
- the margin of error is the sampling error
- the percentages that are reported have been obtained from a sample
and are being generalized to the whole population
- there will always be some margin of error when sample statistics are
used to represent population parameters
differences are not always systematic – they might be the result of
random factors such as chance
- unsystematic differences that exist from one sample to another are
an example of sampling error
1.6 statistics in the context of research
descriptive and inferential statistics are used to organize and interpret
data
sampling error can affect the interpretation of experimental results
- inferential statistical methods are needed to deal with this problem
general stages of a research study:
1) experiment (comparison of the groups)
2) data collection
3) descriptive statistics are used to organize and simplify the data
- provide a simplified, organized description of the scores in a graph/
with an average sample score
4) inferential statistics are used to interpret the results/ the outcome
- the goal of inferential statistics is to help researchers decide on what
interpretation fits the data (whether the results are due to chance or
whether they reflect an actual difference)
QUESTION:
, 1. In general, descriptive statistics are used to summarize the data from a
research study while inferential statistics are used to determine what
conclusions are justified by the results.
2. Data structures, research methods, statistics
2.1 individual variables: descriptive research
some studies describe individual variables as they exist naturally
ex. a college official conducting a survey to describe the eating and
study habits of college students
non-numerical scores are typically described by computing the
proportion/percentage in each category
ex. a newspaper article which states “34.9% of Americans are obese”
which is roughly 35 pounds over healthy weight
2.2 relationships between variables
most research aims to examine relationships between variables
to establish the existence of a relationship, researchers need to make
observations/ measurements of the variables
- the measurements can be classified into two distinct data structures
that also help classify different research methods and different
statistical techniques
data structures:
1) correlational method – one group with two variables measured for
each individual
- observation of the two variables as they exist naturally in order to
examine the relationship = simple measurement of the two variables
for each individual
- consistent patterns in data provide evidence for a relationship
- consistent patterns in the data are easier to see if the scores are
presented in a graph/ scatterplot
QUESTION:
1. In a correlational study how many variables are measured for each
individual and how many groups of scores are obtained?
ANSWER: 2 variables and 1 group
- in a scatterplot each individual is represented by a point/ dot
2) experimental and non-experimental methods – comparing two
or more groups of scores
- the relationship between variables is examined by using one of the
variables to define the groups (independent = manipulated variable)
and then measuring the second variable to obtain scores for each
group (dependent = measured variable)
- systematic differences between groups provide evidence for an
existing relationship
correlational method = correlational research strategy – two
different variables are observed to determine whether there is a
relationship between them
2.3 statistics for the correlational method
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller anniedeshkova. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $8.16. You're not tied to anything after your purchase.