Hypothesis Testing & P-Values

The detective work of data science: making decisions under uncertainty.

The Core Idea: What is Hypothesis Testing?

Think of hypothesis testing as being a data detective. You start with a default assumption, the Null Hypothesis (H₀), which states there is no effect or no difference (e.g., "a new drug has no effect"). Then, you gather evidence (your sample data) to see if you have enough proof to reject that default assumption in favor of an alternative, the Alternative Hypothesis (H₁) (e.g., "the new drug has an effect").

The p-value is the crucial piece of evidence. It's the probability of observing your data (or something even more extreme) if the null hypothesis were actually true. A small p-value (typically < 0.05) suggests that your observed data is very unlikely under the null hypothesis, giving you a reason to reject it.

The Two Paths: Parametric vs. Non-Parametric

The type of data you have determines the statistical test you can use. The main fork in the road is between parametric and non-parametric tests.

👨‍🍳 Parametric Tests

The Professional Chef: Assumes ingredients (data) meet certain standards (e.g., normal distribution). Precise and powerful when assumptions are met.

🏕️ Non-Parametric Tests

The Campfire Cook: Makes no strict assumptions about ingredients. More flexible and robust, especially with unusual, ranked, or non-normal data.

Hypothesis Testing & P-Values

The Two Paths: Parametric vs. Non-Parametric

T-Test

Z-Test

ANOVA

F-Test

Pearson Correlation

Chi-Squared Test

Mann-Whitney U Test

Kruskal-Wallis Test

Wilcoxon Signed-Rank Test

Spearman's Rank Correlation

Friedman Test

Kolmogorov-Smirnov (K-S) Test

Hypothesis Testing & P-Values

The Two Paths: Parametric vs. Non-Parametric

T-Test

Z-Test

ANOVA

F-Test

Pearson Correlation

Chi-Squared Test

Mann-Whitney U Test

Kruskal-Wallis Test

Wilcoxon Signed-Rank Test

Spearman's Rank Correlation

Friedman Test

Kolmogorov-Smirnov (K-S) Test