AE 07: Practicing with probabilities
Goal
Understand and calculate basic probabilities using a real-world dataset.
Data
A cohort study on coffee consumption and mortality from:
Did not die |
Died | |
---|---|---|
Does not drink coffee | 5438 | 1039 |
Drinks coffee occasionally | 29712 | 4440 |
Drinks coffee regularly | 24934 | 3601 |
Source: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5788283/
Definitions:
Event A: The person died.
Event B: The person is a non-coffee drinker.
Two important rules
Suppose we have events
- Complement Rule:
- Inclusion-Exclusion:
Exercises
Calculate the following probabilities for a randomly selected person in the cohort:
Calculate the total number of people in the cohort:
Add response here.
Calculate
: Probability that the person died
Add response here.
Calculate
: Probability that the person is a non-coffee drinker
Add response here.
Calculate
: Probability that the person died and is a non-coffee drinker
Add response here.
Calculate
using the formula for the union of two events:
Add response here.
Calculate
: Probability that the person died or is a non-coffee drinker
Add response here.
Note:
Discussion Questions:
What do these probabilities tell us about the relationship between coffee consumption and mortality in this cohort?
Are there any limitations to interpreting these probabilities as causal effects?
How might additional factors or confounders influence these probabilities?