Interactive Guide to the Kolmogorov-Smirnov (K-S) Test

A powerful test to determine if your data follows a specific distribution, like the normal distribution.

Core Concepts

Purpose & Analogy

The K-S test acts like a "goodness-of-fit" ruler. It measures the maximum distance between the shape of your sample data (the Empirical CDF) and the shape of a theoretical distribution (the Theoretical CDF). If the distance is too large, you conclude your data doesn't fit that theoretical shape.

When to Use It

The most common use is to test for **normality**. Before you use a parametric test like a T-Test or ANOVA, you should check if your data is normally distributed. The K-S test is a formal way to do this. It can also be used to check if two different samples come from the same distribution.

Visualizing Goodness-of-Fit

This chart plots the cumulative distribution of your sample data against the ideal cumulative distribution of a perfect normal curve. The closer the two lines are, the better the fit.

Example: We generate a sample of data and plot its Empirical Cumulative Distribution Function (ECDF). We then overlay the theoretical Cumulative Distribution Function (CDF) of a normal distribution. Toggle between a normal sample and a uniform sample to see how the ECDF's fit changes.