data warehouses correct answers vast digital repositories where data are recorded and stored
electronically
Big Data correct answers data sets so large that traditional methods of storage and analysis are
inadequate
transactional data correct answers data collected for recording the company's' transactions
data mining or predictive analytics correct answers the process of using transactional data to
make other decisions and predictions
business analytics correct answers describes any use of statistical analysis to drive business
decisions from data
cases correct answers rows of a data table that correspond to individuals and about whom we
record some characteristics
respondents correct answers individuals who answer a survey
subjects or participants correct answers people in an experiment
experimental units correct answers animals, plants, websites, or other inanimate objects
variables correct answers characteristics recorded about each individual or case - columns of the
data table and identify WHAT has been measured
metadata correct answers typically contains info. about how, when, and where (and maybe why)
the data were collected; who each case represents; and the definitions of all the variables
spreadsheet correct answers where data are typically saved; rows represent cases; columns
represent variables
relational database correct answers when two or more separate data tables are linked together so
that information can be merged across them
relation correct answers each data table in a relational database because it is about a specific set
of cases with info. about each of these cases for all the variables
categorical/qualitative variable correct answers when a variable names categories and answers
questions about how cases fall into those categories
quantitative variable correct answers when a variable has measures numerical values with units
and the variable tells us about the quantity of what is measured
, identifier variable correct answers a unique identifier assigned to each individual or item in a
group
nominal variables correct answers categorical variables used only to name categories (that don't
have order)
ordinal correct answers when data values can be ordered
time series correct answers variables that are measured at regular intervals over time
cross-sectional data correct answers when several variables are all measured at the same time
point
frequency table correct answers organizes data by recording totals and category names as in the
table below
3 rules of data analysis correct answers 1. reveal things that can't be seen in a table of numbers
2. show important features and patterns in the data
3. provide an excellent means for reporting findings to others
area principle correct answers the area occupied by a part of the graph should correspond to the
magnitude of the value it represents
bar chart correct answers displays the distribution of a categorical variable, showing the counts
for each category next to each other for easy comparison
relative frequency bar chart correct answers when the counts are replaced with percentages;
looks the same as the bar chart, but shows the proportion of visits in each category rather than
counts
pie charts correct answers show the whole group of cases as a circle sliced into pieces with sizes
proportional to the fraction of the whole in each category
categorical data condition correct answers the data are counts or percentages of individuals in
categories; be sure the categories don't overlap; consider what you are attempting to
communicate about the data
contingency table correct answers sho how individuals are distributed along each variable
depending on the value of the other variable
marginal distribution correct answers total count that occurs when the value of that variable is
held constant (in a contingency table)
cell correct answers any intersection of a row and column of a contingency table that gives the
count for a combination of values of the two variables