OMSA Midterm 2 Exam Questions And Accurate
Answers 2024-2025
Overfitting - Number of factors too close to, or larger than number of data points -- fitting
to both real effects and random effects. Includes too many variables in solution!
Avoiding overfitting - Need number of factors same order of magnitude as the number of
points
Need enough factors to get good fit from real effects and random effects
Simplicity - Answer Simple models are better than complex. When fewer factors exist,
less data collection is required -- less chance for including factors that are not
significant. Another reason for variable selection!
DOE - Answer systematic method to determine the relationship between factors
affecting a process and the output of that process.
must make sure either:
1) 2 data sets have same mix
2) break down data into smaller tests that test all factors, not just one.
Forward selection - Solution go step by step either narrowing or building a model --
begin with no factors.
Only add new factors with p-value ≤ 0.1 removing any factors above 0.05.
means: at each step it does the one thing that looks best without taking future options
into consideration
scaled variable selection models - Answer lasso, elastic net, (and ridge even though not
variable selection)
, backward selection - Answer start with model includes all factors and at each step find
worst factor and remove it from the model.
stop when there's no factor bad enough to remove and model no longer contains factors
we'd like to keep.
step-wise regression - Response combination of forward and backward elimination:
start with either all factors, or no factors
after each addition of new factor (and at end): immediately remove any factors that no
longer appear any good. Do not consider future options.
can use AIC or BIC, or model's R squared to decide which factor to add/remove in fwd,
bckwd, or stepwise
Lasso method - Answer adds penalty τ to std reg model. selects coefficients that
minimize the sum of squared errors.
right value of τ depends on two things :
1) how many variables you want
2) quality of model for allowing more variables
Correlation-- chooses only one to have nonzero coefficient. The other is left out.
SAG 19
elastic net-- Solution constrain a combination of absolute value and coefficients and
their squares--a combination of ridge and lasso.
must scale and choose appropriate value of τ+λ
ridge regression--Solution similar to elastic net but without absolute value term (lasso
part).
adds quadratic term that shrinks coefficient values- it pushes the reg coeff to 0 or
regularizes them. Adds bias but shrinks variance as cannot shrink to 0 as easily as
quadratic term
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller Easton. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $11.99. You're not tied to anything after your purchase.