100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
VU Master Health Sciences: Advanced Statistics - Summary lectures and literature: How to analyse clustered and longitudinal data and how to interpret the results? $7.92
Add to cart

Summary

VU Master Health Sciences: Advanced Statistics - Summary lectures and literature: How to analyse clustered and longitudinal data and how to interpret the results?

 92 views  10 purchases
  • Course
  • Institution
  • Book

All the practical information on clustered data and longitudinal data in a very compact version. How to apply the appropriate analysis step by step? How to interpret the results? - General information about Standard regression analysis (linear and logistic), Clustered data and Longitudinal data. Ba...

[Show more]

Preview 6 out of 30  pages

  • No
  • 1 t/m 9
  • December 3, 2021
  • 30
  • 2021/2022
  • Summary
avatar-seller
Standard regression analysis (SRA)
Linear regression analysis
- Describes relationship by fitting line to observed data.
- Uses straight line (logistic/non-linear models use a curved line).
- Estimating how dependent variable changes as independent variable(s) change.


- Y = predicted value dependent for any given value of the independent variable
- B0 = intercept
Predicted value Y when X = 0
- B1 = regression coefficient
How much Y changes as X increases
Denotes magnitude of change in Y
- X = independent variable
- E = error of the estimate
The amount of variation in the estimate of the regression coefficient

OBTAINING REGRESSION LINE
- Least square means = finding best fit for data set points by minimizing the sum of residuals of
points from the plotted curve.

3 steps:
1. Square each residual
2. Sum all squared residuals
3. Minimize the total of the squared values

ASSUMPTIONS
1. Linear relationship between X and Y
2. Normal distribution residuals
3. Homoscedasticity residuals
4. Independent observations




- Independent-samples T-test = to compare means between 2 unrelated groups on the same
continuous Y.
- One-way ANOVA = to compare means between >3 unrelated groups on the same continuous Y.

LINEAR REGRESSION ANALYSIS OUTCOME

,Logistic regression analysis
- Describes relationship between binary Y and >1 other covariates (X).
- To fit a line between observations Y is transformed into the logarithm of the odds (ln(odds)).
- E-power of b1 = odds ratio (Exp(B)).


- Ln(odds of …) = the natural log of the odds of the outcome
- B0 = intercept
The natural log of the odds of the outcome when X = 0
- B1 = regression coefficient
How much ln(odds of …) changes when X changes with 1 unit
 taking the E-power of b1 gives the odds ratio (more easy to interpret)

LOGISTIC REGRESSION ANALYSIS OUTCOME




Confounding
- Confounding = a distortion that modifies an relationship between exposure and outcome, because
the factor is associated with both exposure and outcome.
- Re-assess b1 after adding the potential confounder (b2) into the model.
 compare crude b1 and adjusted b1 and calculate the percentage difference in the b1

Crude b1 Adjusted b1
Logistic -0.461 -0.388
Linear 2.149 2.212

Calculation logistic
-0.% = -0.00461
1% = -0.00461
-0.388 / -0.00461 = 84
100 – 84 = 16%  >10% there is confounding by
sex

Calculation linear
2.% = 0.02149
1% = 0.02149
2..02149 = 102.9
100 – 102.9 = -2.9%  <10% there is no confounding by sex

,Effect modification
- Effect modification = when magnitude of effect exposure (X) on Y differs between the level of the
third variable.
 exposure having different effects
- Assess p-value of interaction term (third variable).
- If significant, results of effect should be reported separately for the different subgroups.

,Basic principles linear mixed model
analysis
- Linear MMA extended version of linear regression analysis.
- Clustering is present when a set of objects group in such a way that objects in the same group (=
cluster) are more similar to each other than to those in other clusters.
- Observations within clusters are correlated with each other.
- You have to take this into account in your analysis with MMA.




Similar approach in regression models with clustering:
1. Intercept (u)
2. Slope (uk)
3. Intercept and slope (ukj)

General idea MMA – 3 steps
1. Estimate intercepts(/slopes) for different groups
2. Draw a normal distribution over the different intercepts(/slopes)
3. Estimate the variance of that normal distribution

Covariance between random slope & random
intercept
- Also known as the covariance (interaction) between the random slope and
random intercept.

1. Negative covariance
Indicates inverse relationship
For levels with relatively high intercept, a relatively low slope is observed
2. Positive covariance
Indicates same relationship
4. For levels with relatively high intercept, a relatively high slope is observed

Intraclass Correlation Coefficient (ICC) – ICC as
indicator
- ICC = indication average correlation of observations of subjects living in the same cluster.
- Indicates how strong units in the same group resemble each other (correlation).
- When calculating ICC with a model that includes an X variable, remaining variance is lower.
- Pure ICC calculated with intercept-only model (model without X).

,Variance used as explanation (specific application MMA)
- Using random effects for explanation differences.
- % of the difference in Y between the levels of the cluster is explained by X.
- Calculate with random intercept of the intercept-only model and the random intercept of the
model with X.

, Example linear MM
Explained with cross-sectional cohort study investigating the relationship between X (physical activity = PA)
and Y (health).
- Two-level structure
Subject = lowest level
Neighbourhood (NBH) = highest level
- Linear regression analysis should adjust for NBH by MMA.

1. Naïve linear MMA
- Without an adjustment for NBH




General information
- MMA without adjustment
- Log likelihood
- Number of observations
Fixed part
- Activity = b1, with standard error (S.E.), z-value, corresponding p-value and 95% CI estimated
around the b1.
Difference in health when there is 1 unit difference in PA
- _cons = intercept
Value of health when PA equals 0
Random part
- Var(Residual) = residual variance (the error variance/unexplained variance).
Because it’s a naïve model, random part only contains variance of the residual.

2. Add random intercept to the model
- Adding a random intercept on cluster level to the model.
- To adjust for NBH level.

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller Student1064. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $7.92. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

53068 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$7.92  10x  sold
  • (0)
Add to cart
Added