Back to Dashboard

Understanding Basic Statistical Testing

Learn the foundations of biostatistics, upload your dataset, and perform automated hypothesis tests.

๐Ÿšฆ Statistics for Beginners: The Bare Minimum
1. The Baseline Assumption
(The "Null Hypothesis")

Assume nothing interesting is happening.

  • The drug does nothing.
  • The patient is perfectly healthy.
  • There is no difference between groups.
2. The P-Value
(The Surprise Score)

If the Baseline Assumption is true, how likely was this data to occur by chance only?

A P-value of 0.03 means we would only see this data by chance 3% of the time. We are very surprised!
3. The Verdict
(The 0.05 Rule)
If p ≤ 0.05:
We found something real! Reject the Baseline Assumption.
(Difference is Statistically Significant)
If p > 0.05:
It's probably just noise. Keep the Baseline Assumption.
(Difference is Not Statistically Significant)

Why do we NEVER say "We proved the Baseline Assumption is true or Null Hypothesis is Accepted"?

The Lesson

Imagine a patient arriving at the emergency room with severe abdominal pain. The doctor suspects appendicitis and orders a CT scan or ultrasound to look for evidence of inflammation.

Now, if the scan clearly shows an inflamed appendix, the diagnosis is confirmed. But if the scan does not show convincing signs of appendicitis, it does not prove that the patient is perfectly healthy; it simply means that, based on the current evidence, there is not enough proof to confirm the disease.

This is exactly how a p-value works in hypothesis testing. The baseline assumption (null hypothesis) assumes "no disease" or "no effect." A small p-value suggests that the observed findings would be very unlikely if there were truly no disease, so we reject the null hypothesis. A large p-value, however, does not prove that there is no disease or no effect; it only indicates that the evidence we observed is not strong enough to confidently rule out chance.

In essence, a p-value measures the strength of evidence against the null hypothesisโ€”it does not prove that the null hypothesis is true.

Baseline (Nothing Happening) Most results land safely in the middle.
p = 0.03
Null Hypothesis Rejected (There is a statistically significant difference)
One-Tailed vs Two-Tailed Test

Imagine you are conducting a clinical trial on a new antihypertensive drug. The standard drug lowers systolic blood pressure by 10 mmHg on average.

Two-Tailed Test: If your question is: โ€œDoes the new drug change blood pressure compared to the standard?โ€ โ€” you are open to either possibility: it might lower BP more, or it might unexpectedly increase it. This is like a hospital safety committee that investigates both unusually high and unusually low oxygen levels in ICU patients. You are guarding against deviations in both directions.
One-Tailed Test: Now imagine your drug has shown evidence that it can only lower BP. Your question becomes: โ€œDoes the new drug reduce blood pressure more than the standard?โ€ This is like a vaccination campaign where the only meaningful question is whether the vaccine reduces infection rates. Your statistical attention is focused in one direction.

*One-tailed tests should only be chosen when there is strong theoretical or clinical justification before data analysis.

What is "Degree of Freedom"?
The Anemia Patient Analogy: Suppose you are analyzing hemoglobin levels of 5 patients. Once you calculate the mean, it acts like a constraint. The first four patients can vary freely, but the fifth patient's level must balance the others to keep the average fixed. Although you have 5 data points, only 4 are truly "free to vary."
df = n - 1 (5 - 1 = 4)
The Hospital Budget Analogy: Imagine distributing a fixed budget of โ‚น10 lakhs across five departments. You can allocate funds freely to the first four, but the fifth automatically receives whatever remains to keep the total at โ‚น10 lakhs. That last allocation has no independent freedom.

The Core Concept:
Degree of freedom represents the number of independent pieces of information available to estimate variability in a study.

Step 2: Upload Dataset (CSV)
๐Ÿ’ก How to prepare your data:

Your CSV must have headers in the first row. Each column represents one variable, and each row represents one patient. Example format:
Group, Age, BloodPressure, Outcome Treatment, 45, 120.5, Recovered Control, 52, 135.2, Not Recovered Treatment, 38, 118.0, Recovered Control, 61, 142.8, Not Recovered Treatment, 41, 122.1, Not Recovered Control, 55, 138.5, Recovered Treatment, 33, 115.4, Recovered Control, 48, 129.0, Recovered

Waiting for raw data...
๐Ÿ“Š Available Tests in EpiSense
Comparison of Means
  • Independent T-test (2 groups)
  • Paired T-test (Before/After)
  • One-Way ANOVA (3+ groups)
  • Mann-Whitney / Kruskal-Wallis (Non-parametric)
Relationships & Trends
  • Pearson's r (Linear Correlation)
  • Spearman's Rho (Rank Correlation)
  • Scatter Plot Visualizations
Categorical Data
  • Pearson Chi-Square (Groups/Outcomes)
  • Fisher's Exact Test (Small Samples)
  • 2x2 Contingency Tables

Step 3: Choose Testing Method

๐Ÿงฎ Fast Summary Statistics Calculator

Bypass raw data uploads. If you already have the summary statistics (Means, Standard Deviations, Sample Sizes, or Contingency Tables) from a paper or previous analysis, select the test below to calculate the P-Value instantly.