My D204 WGU Study Set (kencollier3)
Data quality is measured in terms of this: - correct answer-Uniqueness and relevance
For example businesses may be able to use a Twitter API to pull in Twitter data in what kind
of format?
What level of structure is the data? - correct answer-JSON
Semi-structured data
ggplot2, tidyverse, caret are essential libraries of which tool? - correct answer-R
Google Trends is an example of what kind of data? - correct answer-Open Data --> Social
Media
How and where should the safety margin (halfway amount between average time completed
and slowest possible time completed) be added? - correct answer-Spread throughout the
critical path
How is law confined? - correct answer-It's confined to the territory or the place that created it,
not the technology.
If an analyst wants to help create an online store that intelligently recommends certain
products for customers to buy, what type of analysis would they be focusing on? - correct
answer-Predictive (because they are predicting FUTURE habits)
If you're needing to crash a project (speed up the project to get it done on schedule), what
are three ways to do it? - correct answer-Money up
Quality down
Overlapping tasks
In what phase does the analyst deal with the following:
Central Tendency/ Measures of center (e.g., mean, median, mode), variability (e.g., standard
deviations and quartiles) and distributions (e.g., normal, skewed, etc)
Identify basic correlations between variables
Pattern discovery - correct answer-Data exploration/Exploratory Data
Analysis(EDA)/Descriptive Statistics
In what phase does the analyst deal with the following:
Creating training and testing datasets to build models from
Identify/detect patterns
Determine if groups (clusters) exist in data
Classify data into groups
, Create models that "learn" and improve (e.g., machine/deep learning, AI, etc) - correct
answer-Data Mining/Machine Learning/AI/Supervised, Unsupervised Models
In what phase does the analyst deal with the following:
Estimate/project future values or likelihood of an event.
Extend correlations found in EDA to mathematical models
Predict/determine output values based on input values
Cross-validation of predictive models to ensure accuracy. - correct answer-Predictive
Modeling/Data Modeling/Correlation based models/Regression models/Time Series
In what phase does the analyst deal with the following:
Fixing improperly formatted values
Dealing with duplicates, missing data, and outliers
Data reduction - correct answer-Data cleaning/wrangling/scrubbing/munging
In what phase does the analyst deal with the following:
Gather/collect data from a variety of sources
Provide structure to data accessible via relational databases (SQL)
Build data pipeline (ETL)
Use of API to download data from an external source - correct answer-Data acquisition
In what phase does the analyst deal with the following:
Tell a story with data
Provide a summary of analytic analysis
Provide insights to stakeholders
Create insightful graphs that showcase trends and forecasts - correct answer-Reporting and
visualization/Dashboards
In what phase does the analyst identify the stake holders and research questions? - correct
answer-Business Understanding/Planning/Discovery
Read this IRAC story snippet. What step in the IRAC process is being performed?
"Privacy laws inhibit the use of personal information for corporate use" - correct answer-Rule
- state the relevant case laws and statutes
Read this IRAC story snippet. What step in the IRAC process is being performed?
"Since it was based solely on shopping patterns and not private information, there was no
legal breach or violation" - correct answer-Application - Apply relevant rules/laws to the facts
that created the issue
Read this IRAC story snippet. What step in the IRAC process is being performed?
Data quality is measured in terms of this: - correct answer-Uniqueness and relevance
For example businesses may be able to use a Twitter API to pull in Twitter data in what kind
of format?
What level of structure is the data? - correct answer-JSON
Semi-structured data
ggplot2, tidyverse, caret are essential libraries of which tool? - correct answer-R
Google Trends is an example of what kind of data? - correct answer-Open Data --> Social
Media
How and where should the safety margin (halfway amount between average time completed
and slowest possible time completed) be added? - correct answer-Spread throughout the
critical path
How is law confined? - correct answer-It's confined to the territory or the place that created it,
not the technology.
If an analyst wants to help create an online store that intelligently recommends certain
products for customers to buy, what type of analysis would they be focusing on? - correct
answer-Predictive (because they are predicting FUTURE habits)
If you're needing to crash a project (speed up the project to get it done on schedule), what
are three ways to do it? - correct answer-Money up
Quality down
Overlapping tasks
In what phase does the analyst deal with the following:
Central Tendency/ Measures of center (e.g., mean, median, mode), variability (e.g., standard
deviations and quartiles) and distributions (e.g., normal, skewed, etc)
Identify basic correlations between variables
Pattern discovery - correct answer-Data exploration/Exploratory Data
Analysis(EDA)/Descriptive Statistics
In what phase does the analyst deal with the following:
Creating training and testing datasets to build models from
Identify/detect patterns
Determine if groups (clusters) exist in data
Classify data into groups
, Create models that "learn" and improve (e.g., machine/deep learning, AI, etc) - correct
answer-Data Mining/Machine Learning/AI/Supervised, Unsupervised Models
In what phase does the analyst deal with the following:
Estimate/project future values or likelihood of an event.
Extend correlations found in EDA to mathematical models
Predict/determine output values based on input values
Cross-validation of predictive models to ensure accuracy. - correct answer-Predictive
Modeling/Data Modeling/Correlation based models/Regression models/Time Series
In what phase does the analyst deal with the following:
Fixing improperly formatted values
Dealing with duplicates, missing data, and outliers
Data reduction - correct answer-Data cleaning/wrangling/scrubbing/munging
In what phase does the analyst deal with the following:
Gather/collect data from a variety of sources
Provide structure to data accessible via relational databases (SQL)
Build data pipeline (ETL)
Use of API to download data from an external source - correct answer-Data acquisition
In what phase does the analyst deal with the following:
Tell a story with data
Provide a summary of analytic analysis
Provide insights to stakeholders
Create insightful graphs that showcase trends and forecasts - correct answer-Reporting and
visualization/Dashboards
In what phase does the analyst identify the stake holders and research questions? - correct
answer-Business Understanding/Planning/Discovery
Read this IRAC story snippet. What step in the IRAC process is being performed?
"Privacy laws inhibit the use of personal information for corporate use" - correct answer-Rule
- state the relevant case laws and statutes
Read this IRAC story snippet. What step in the IRAC process is being performed?
"Since it was based solely on shopping patterns and not private information, there was no
legal breach or violation" - correct answer-Application - Apply relevant rules/laws to the facts
that created the issue
Read this IRAC story snippet. What step in the IRAC process is being performed?