Exploring Normal Distribution & Central Limit Theorem.
Watch extreme earners physically enter the population, dragging the Mean away from the Median.
Skewness = Σ(X - μ)³ / (N · σ³)
Watch the "shoulders" collapse as mass pushes into the peak and tails (Red Zones).
Kurtosis = Σ(X - μ)⁴ / (N · σ⁴)
*Excess
Kurtosis = Kurtosis - 3
Want to see how your own data holds up? Upload a CSV or TXT file containing numerical data (comma or space separated) to instantly calculate its Skewness and Kurtosis!
The ultimate statistical benchmark. We "morph" any complex data into a simplified Mean=0, SD=1 scale for fair comparison.
Comparing heights in cm (μ=170) vs inches (μ=67) is hard. Standardization "morphs" both into Z-scores so you can compare them apples-to-apples.
Once standardized, any value becomes a Z-score. This allows us to calculate probability (area under the curve) using a standard table.
Z = (X - μ) / σ
Where X is raw value, μ is mean, σ is SD.
The foundation of any statistical analysis starts with understanding your data types.
Describes qualities or characteristics. Values are labels or names.
Measured quantities. Values are expressed as numbers.
Characterized by equal intervals between successive values but lacking a true zero point.
Possesses all properties of an interval scale with the addition of an Absolute Zero point.
| Main Category | Variable Type | Scientific Plot Types |
|---|---|---|
| Qualitative | Nominal / Ordinal | |
| Associations | ||
| Quantitative | Discrete / Continuous | |
| Associations | ||
| Clustering |
Scientific Data Insight
A framework for disease control and health preservation.
Actions taken prior to the onset of disease to remove the possibility that it will ever occur. Target: Healthy Population.
Immunization (COVID-19/Polio), use of seatbelts, nutritional supplementation (Folic acid in pregnancy).
Actions which halt the progress of a disease at its incipient stage and prevent complications. Target: Early Disease.
Mammography for breast cancer screening, Pap smears, BP screening, early treatment of hypertension.
Measures available in the late stages to mitigate impact, limit disability, and rehabilitate. Target: Established Disease.
Physiotherapy for stroke patients, cardiac rehabilitation after MI, speech therapy after neurological insult.
Actions taken to identify patients at risk of over-medicalization and protect from unnecessary interventions. Target: At-risk of iatrogenesis.
Avoiding unwarranted prostate biopsy in elderly, depumping patients from polypharmacy, ethical end-of-life care.
| Disease Stage ↓ | 🛡️ Primary | 🔍 Secondary | 🏥 Tertiary | ⚖️ Quaternary |
|---|---|---|---|---|
| Well / Susceptible |
Vaccination Health education |
— | — |
Avoid over-screening Prevent incidentalomas |
| Pre-clinical / Latent |
Chemoprophylaxis e.g., Statins in high-risk |
Screening Mammography, Pap smear |
— | Right test, right patient |
| Clinical Disease | — |
Early treatment Reduce severity |
Definitive treatment Surgery, chemo, ICU |
Avoid polypharmacy |
| Disability / Sequelae | — | — |
Rehabilitation Physio, prosthetics |
Ethical end-of-life Palliative care dignity |
█ Active intervention zone — Not applicable at this stage Cells color-intensity indicates relevance strength.
Understanding the outcome of hypothesis testing: Being right, or being wrong.
Slide the parameters below to see how alpha, sample size, and effect size impact our ability to detect a true difference (Power).
Rejecting H₀ when it is actually true. Saying a drug works when it doesn't.
Failing to reject H₀ when Ha is true. Missing a drug that actually works.
| H₀ True (No real effect) |
H₀ False (Real effect exists) |
|
|---|---|---|
| Reject H₀ |
❌ Type I Error (α) False Positive |
✅ Correct (Power = 1−β) True Positive |
| Fail to Reject H₀ |
✅ Correct (1−α) True Negative |
⚠️ Type II Error (β) False Negative |
A pharma company tests Drug X (new) vs Placebo for reducing systolic BP.
H₀: Drug X = Placebo (no difference in SBP reduction).
Ha: Drug X ≠ Placebo (Drug X reduces SBP more).
Drug Controlling Authority approves Drug X even though it has no real benefit. Patients take an ineffective drug with possible side effects. Resources wasted.
Study concludes Drug X doesn't work, so it's shelved. Patients miss out on an effective treatment that could have saved lives.
⚡ Key Insight: Type I and Type II errors have an inverse relationship. Making α stricter (reducing false positives) increases β (more false negatives), and vice versa. The only way to reduce both simultaneously is to increase sample size.
The true average value of the entire population.
μ = ΣX / N
Average distance of population points from the mean.
σ = √[ Σ(X-μ)² / N ]
The average value calculated from a specific sample size (n).
x̄ = Σx / n
Estimated spread of the population using (n-1) degrees of freedom.
s = √[ Σ(x-x̄)² / (n-1) ]
Quantifies the precision of the sample mean. Used for calculating Confidence Intervals and conducting Hypothesis Testing.
SE = σ / √n
A precision range that captures the true population mean with a specific level of certainty. The interval width is determined by the Standard Error (SE).
CI = x̄ ± (1.96 * SE)
Sample means follow a Normal Distribution regardless of population shape, if n is large enough.
Allows making inferential predictions about a population without knowing its true distribution shape.