All Topics

Browse every topic available on QuantPrep. Use the search to find exactly what you're looking for.

QuantLab

Interactive tools for hands-on probability and statistics analysis.

Linear Algebra

Vectors, matrices, and eigenvalues. The language of data.

Advanced Statistics

The science of collecting, analyzing, and interpreting data.

Probability for Quants

Master random variables, distributions, and stochastic processes.

Time Series Analysis

ARIMA, GARCH, and forecasting market movements.

Machine Learning

Building predictive models for financial markets.

The Two Views of a Vector

Vectors as geometric arrows vs. vectors as ordered lists of numbers (the data science view).

Vector Operations

Addition, subtraction (tip-to-tail rule), and scalar multiplication (stretching/shrinking).

The Dot Product, Norms, and Angles

The dot product as a measure of 'projection' or 'agreement.' L1 and L2 norms as measures of length/magnitude. Cosine similarity as a practical application.

Orthogonality

The concept of perpendicular vectors (dot product = 0) and its meaning: independence.

The Two Views of a Matrix

A matrix as a container for data (a collection of vectors) vs. a matrix as a linear transformation that moves, rotates, and scales space.

Matrix Operations

Addition, scalar multiplication, and the transpose.

Matrix Multiplication

Taught not just as a rule, but as the composition of linear transformations. This explains why AB ≠ BA.

Special Matrices

Identity matrix (the 'do nothing' operation), inverse matrix (the 'undo' operation), diagonal, triangular, and symmetric matrices.

Linear Combinations and Span

What can you build with a set of vectors?

Linear Independence

Identifying and removing redundant vectors.

Basis and Dimension

The minimal set of vectors needed to define a space and the concept of its dimension.

Vector Spaces and Subspaces

Formalizing these concepts. A subspace as a 'plane' or 'line' within a higher-dimensional space that passes through the origin.

Framing the Problem: Ax=b

Understanding Ax=b from the row picture (intersection of planes) and the column picture (linear combination of columns).

Gaussian Elimination

The core algorithm for solving linear systems. Row operations, row echelon form (REF).

The Solutions to Ax=b

Identifying if a system has a unique solution, no solution, or infinitely many solutions from its REF.

Reduced Row Echelon Form (RREF)

The ultimate, unique 'answer sheet' for a linear system, removing the need for back-substitution.

LU Decomposition

The 'matrix version' of Gaussian Elimination. Solving Ax=b becomes a fast, two-step process of forward and back substitution.

Column Space & Rank

The space of all possible outputs of A. The concept of rank as the "true dimension" of the output space.

The Null Space

The space of all inputs that map to the zero vector. Its connection to multicollinearity in data.

Row Space & Left Null Space

Completing the picture of the four fundamental subspaces.

The Fundamental Theorem of Linear Algebra

How the four subspaces relate to each other and partition the input and output spaces.

The Geometric Meaning of the Determinant

The determinant as the scaling factor of area/volume.

Calculation and Properties

Cofactor expansion and the properties of determinants. A determinant of zero means the matrix squishes space into a lower dimension (i.e., it's not invertible).

Eigenvalues & Eigenvectors

Finding the 'special' vectors that are only scaled by a transformation, not rotated off their span (Ax = λx).

The Characteristic Equation

The calculation behind eigenvalues: solving det(A - λI) = 0.

Diagonalization (PDP⁻¹)

Decomposing a matrix into its core components: 'changing to the eigenbasis, scaling, and changing back.'

Applications of Eigen-analysis

Using eigenvalues for tasks like calculating matrix powers (e.g., for Markov chains).

The Spectral Theorem

For symmetric matrices (like covariance matrices), the eigendecomposition is especially beautiful and stable (A = QDQᵀ). This is the theoretical foundation of PCA.

The Cholesky Decomposition (LLᵀ)

A highly efficient specialization for symmetric, positive-definite matrices, often used in optimization and financial modeling.

The Inexact Problem: Why Ax=b Often Has No Solution

Introducing the goal of minimizing the error ||Ax - b||.

The Geometry of "Best Fit": Projections

Finding the closest point in a subspace (the Column Space) to an external vector.

The Algebraic Solution: The Normal Equations

Deriving AᵀAx̂ = Aᵀb from the projection geometry. This is the engine of Linear Regression.

The Problem with the Normal Equations

Understanding why AᵀA can be ill-conditioned and lead to numerical errors.

The Stable Solution: The Gram-Schmidt Process

An algorithm for creating a "nice" orthonormal basis from any starting basis.

The QR Decomposition

Using Gram-Schmidt to factor A=QR. Show how this makes solving the least squares problem trivial (R = Qᵀb) and numerically robust.

The Singular Value Decomposition (SVD)

The ultimate decomposition (A = UΣVᵀ) that works for any matrix and finds orthonormal bases for all four fundamental subspaces simultaneously.

Principal Component Analysis (PCA)

A direct, powerful application of SVD on the data matrix for dimensionality reduction.

Advanced SVD Applications

Low-rank approximation for noise reduction, and the core ideas behind recommendation systems.

The Covariance Matrix & Portfolio Risk

Deriving portfolio variance from first principles using linear algebra.

Portfolio Optimization & The Efficient Frontier

Using linear algebra to construct optimal portfolios.

The Capital Asset Pricing Model (CAPM)

Understanding the relationship between risk and expected return.

Arbitrage & The Fundamental Theorem of Asset Pricing

Connecting "no free lunch" to the geometry of vector spaces.

Markov Chains for State Transitions

Modeling dynamic systems like credit ratings with transition matrices.

Fixed Income (Bond) Mathematics

Duration and convexity as linear algebraic concepts.

Hypothesis Testing Guide

A comprehensive guide to choosing the right statistical test.

An Introduction to Hypothesis Testing

A practical guide to deciding if your results are a real breakthrough or just random noise.

T-Test

Compares the means of two groups, assuming normal distribution.

Z-Test

Compares means of large samples (n>30) with known population variance.

ANOVA

Compares the averages of three or more groups.

F-Test

Compares the variances (spread) of two or more groups.

Pearson Correlation

Measures the linear relationship between two continuous variables.

Chi-Squared Test

Analyzes categorical data to find significant relationships.

Mann-Whitney U Test

Alternative to the T-Test when data is not normally distributed.

Kruskal-Wallis Test

Alternative to ANOVA for comparing three or more groups.

Wilcoxon Signed-Rank Test

Alternative to the paired T-Test for repeated measurements.

Spearman's Rank Correlation

Measures the monotonic relationship between two ranked variables.

Friedman Test

The non-parametric alternative to a repeated-measures ANOVA.

Kolmogorov-Smirnov (K-S) Test

Tests if a sample is drawn from a specific distribution.

Hypothesis Testing & P-Values

The detective work of data science.

Descriptive Statistics Explorer

Interactive guide to mean, median, skewness, and kurtosis.

The Central Limit Theorem (CLT)

Discover how order emerges from chaos.

Confidence Intervals

Understanding the range where a true value likely lies.

Z-Table Calculator

Calculate probabilities from Z-scores and vice-versa.

The Normal Distribution

The ubiquitous "bell curve."

Monte Carlo Simulation

Using random simulation to solve complex problems.

Time Series Decomposition

Breaking down a time series into its core components.

Autocorrelation (ACF & PACF)

Measuring how a time series correlates with its past values.

Volatility & Standard Deviation (GARCH)

Modeling the changing volatility of financial returns.

Efficient Frontier & Sharpe Ratio

Finding the optimal portfolio for a given level of risk.

Kalman Filters

Dynamically estimating the state of a system from noisy data.

Stochastic Calculus & Ito's Lemma

The calculus of random walks, essential for derivatives pricing.

Set Theory, Sample Spaces, and Events

Understanding the building blocks of probability.

Axioms of Probability (Kolmogorov)

The three fundamental rules that govern all of probability.

Conditional Probability and Independence

How the occurrence of one event affects another.

Law of Total Probability and Bayes' Theorem

Updating your beliefs in the face of new evidence.

Probability Mass Functions (PMF) and Cumulative Distribution Functions (CDF)

Describing the probabilities of discrete outcomes.

Expected Value E[X], Variance Var[X], and Standard Deviation

Calculating the center and spread of a random variable.

Common Discrete Distributions (Binomial, Poisson, Geometric)

Exploring key models for discrete random events.

Moment Generating Functions (MGFs) for Discrete R.V.s

A powerful tool for analyzing distributions.

Probability Density Functions (PDF) and CDF

Describing the probabilities of continuous outcomes.

Expected Value and Variance via Integration

Applying calculus to find the moments of continuous variables.

Common Continuous Distributions (Uniform, Exponential, Gamma)

Exploring key models for continuous random events.

MGFs for Continuous R.V.s

Extending moment generating functions to continuous cases.

Joint PMFs and Joint PDFs

Modeling the behavior of multiple random variables at once.

Marginal and Conditional Distributions

Isolating one variable's behavior from a joint distribution.

Covariance Cov(X, Y) and Correlation ρ

Measuring how two random variables move together.

Independence of Random Variables

Defining when two variables have no influence on each other.

Properties of the Normal Distribution and the Z-Score

Mastering the bell curve and standardization.

Linear Combinations of Independent Normal Random Variables

Understanding how normal variables combine.

Multivariate Normal Distribution

The cornerstone of modern portfolio theory.

Marginal and Conditional Distributions of Multivariate Normal

Dissecting multi-asset models.

Applications in Portfolio Theory and Financial Modeling

Putting the multivariate normal to practical use.

The t-Distribution (Student's t)

The essential tool for inference with small samples.

The χ² (Chi-Squared) Distribution

The basis for tests of variance and goodness-of-fit.

The F-Distribution (Fisher–Snedecor)

The key to comparing variances between two groups (ANOVA).

Convergence in Probability and the Weak Law of Large Numbers (WLLN)

Why casino averages are so stable.

Convergence in Distribution and the Central Limit Theorem (CLT)

Why the normal distribution is everywhere.

Slutsky's Theorem and the Delta Method

Tools for approximating the distribution of functions of random variables.

Definition of a Statistic and an Estimator

Distinguishing between a function of data and a guess for a parameter.

Unbiasedness, Bias, and Asymptotic Unbiasedness

Evaluating the accuracy of estimators.

Efficiency and the Cramér-Rao Lower Bound (CRLB)

Finding the "best" possible unbiased estimator.

Consistency and Sufficiency

Properties of estimators that improve with more data.

Method of Moments (MoM) Estimation

A straightforward technique for finding estimators.

Maximum Likelihood Estimation (MLE)

The most important method for parameter estimation in finance.

Finding MLE Estimates via Optimization

The practical side of implementing MLE.

General Construction of Confidence Intervals (CIs)

A framework for creating intervals for any parameter.

Deriving CIs for Mean and Variance

Using t, χ², and Z pivotal quantities to build intervals.

Null vs. Alternative Hypotheses, Type I and II Errors

The fundamental setup of all hypothesis tests.

Neyman-Pearson Lemma for Optimal Tests

Finding the most powerful test for a given significance level.

Likelihood Ratio Tests (LRT) and Wilks' Theorem

A general method for comparing nested models.

Testing with p-values and Critical Regions

The two equivalent approaches to making a statistical decision.

Simple Linear Regression (SLR)

Modeling the relationship between two variables.

Derivation of the OLS Estimators

The calculus behind finding the "best fit" line.

Properties of the Fitted Model (R-Squared, Residuals)

Assessing how well your linear model fits the data.

Multiple Linear Regression (MLR) in Matrix Form

Extending SLR to multiple predictors using linear algebra.

Derivation of the MLR OLS Estimator

The matrix algebra for solving a multiple regression problem.

Gauss-Markov Theorem and the BLUE Property

The theoretical justification for using OLS.

t-tests for Individual Coefficients

Testing the significance of a single predictor.

F-tests for Joint Hypotheses and Overall Model Significance

Testing the significance of a group of predictors or the entire model.

Model Assumptions (Linearity, Exogeneity, Homoskedasticity)

The critical assumptions that must hold for OLS to be valid.

Multicollinearity and Variance Inflation Factor (VIF)

Diagnosing when predictors are too correlated with each other.

Heteroskedasticity: Detection and Robust Standard Errors

Handling non-constant variance in the error terms.

Autocorrelation: Durbin-Watson Test

Detecting patterns in the error terms over time.

Characteristics of Time Series: Trend, Seasonality, Cycles

Decomposing the components of a time series.

Strict vs. Weak Stationarity

The most important property for modeling time series data.

Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF)

The key tools for identifying the structure of a time series.

ARIMA Models

A class of models for forecasting time series data.

GARCH Models for Volatility

Modeling the changing volatility of financial returns.

Monte Carlo Simulation for Pricing and Risk

Using random simulation to solve complex problems.

Bootstrapping for Estimating Standard Errors

A powerful resampling method for inference.

Jackknife Resampling Techniques

A related method for bias and variance estimation.

Introduction to Random Walks and Martingales

The mathematical foundation of efficient markets.

Geometric Brownian Motion (GBM)

The standard model for stock price paths.

Generalized Linear Models (GLMs)

Extending linear models to non-normal data.

Logistic Regression for Binary Outcomes

Modeling probabilities, such as the probability of default.

Poisson Regression for Count Data

Modeling the frequency of events.

Ridge Regression (L2 Penalty)

A technique to handle multicollinearity and prevent overfitting.

LASSO Regression (L1 Penalty) for Feature Selection

A powerful method for automatically selecting important variables.

Cross-Validation for Hyperparameter Tuning

The gold standard for selecting model parameters.

Bayesian Inference: Priors, Likelihood, and Posteriors

An alternative framework for statistical inference.

Markov Chain Monte Carlo (MCMC)

The computational engine behind modern Bayesian analysis.

Numerical Optimization: Newton-Raphson & Gradient Descent

The algorithms that power MLE and machine learning.

Implementing OLS and MLE in Python/R

Practical coding examples of core statistical techniques.

Bayes' Theorem

A framework for updating beliefs with new evidence.

Bernoulli Distribution

Modeling a single trial with two outcomes.

Binomial Distribution

Modeling a series of success/fail trials.

Poisson Distribution

Modeling the frequency of rare events.

Geometric Distribution

Modeling trials until the first success.

Hypergeometric Distribution

Modeling sampling without replacement.

Negative Binomial Distribution

Modeling trials until a set number of successes.

Discrete Uniform Distribution

Modeling where all outcomes are equally likely.

Multinomial Distribution

Generalizing the Binomial for multiple outcomes.

Gamma Distribution

Modeling waiting times and skewed data.

Beta Distribution

Modeling probabilities, percentages, and proportions.

Exponential Distribution

Modeling the time between events in a Poisson process.

Cauchy Distribution

Modeling extreme events and 'fat-tailed' phenomena.

Laplace Distribution

Modeling with a sharp peak and 'fat tails'.

F-Distribution

Comparing variances between two or more samples.

Student's t-Distribution

The backbone of hypothesis testing with small sample sizes.

Weibull Distribution

Modeling time-to-failure and event durations.

Logistic Distribution

A key distribution in machine learning and growth modeling.

Chi-Squared (χ²) Distribution

The distribution of the sum of squared standard normal deviates.

The Basics: Sample Spaces & Events

Understanding the building blocks of probability.

Combinatorics: The Art of Counting

Techniques for counting outcomes and possibilities.

Conditional Probability & Independence

How the occurrence of one event affects another.

Bayes' Theorem

Updating beliefs in the face of new evidence.

Random Variables (Discrete & Continuous)

Mapping outcomes of a random process to numbers.

Expectation, Variance & Moments

Calculating the center, spread, and shape of a distribution.

Common Discrete Distributions

Exploring Bernoulli, Binomial, and Poisson distributions.

Common Continuous Distributions

Exploring Uniform, Normal, and Exponential distributions.

Joint, Marginal & Conditional Distributions

Modeling the behavior of multiple random variables at once.

Covariance & Correlation

Measuring how two random variables move together.

The Law of Large Numbers (LLN)

Why casino averages are so stable.

The Central Limit Theorem (CLT)

Why the normal distribution is everywhere.

Transformations of Random Variables

Finding the distribution of a function of a random variable.

Moment Generating Functions (MGFs)

A powerful tool for analyzing distributions.

Introduction to Information Theory

Quantifying information with Entropy and KL Divergence.

Introduction to Stochastic Processes & Stationarity

Understanding random phenomena that evolve over time.

Discrete-Time Markov Chains

Modeling memoryless state transitions.

The Poisson Process

Modeling the timing of random events.

Random Walks & Brownian Motion

The mathematical foundation of stock price movements.

Sigma-Algebras & Probability Measures

The rigorous foundation of modern probability.

The Lebesgue Integral & Rigorous Expectation

A more powerful theory of integration.

Martingales

The formal model of a fair game.

Introduction to Itô Calculus

The calculus of random walks, essential for derivatives pricing.

The Language of ML: Data, Features & Labels

The Three Flavors of Learning (Supervised, Unsupervised, Reinforcement)

Your First Models: An Intuitive Look (KNN & Simple Linear Regression)

The Golden Rule: How to Split Your Data (Train, Validate, Test)

How Do We Score a Model? (Accuracy, Confusion Matrix, MSE)

Mental Math for Interviews

Sharpen your calculation speed and accuracy for interviews.

The Financial ML Landscape (Alpha, Risk, Execution)

Feature Engineering for Financial Data (Price, Volume, Order Books)

Core Predictive Models (Trees, Boosting, Regularization)

Backtesting & Model Validation (Walk-Forward, Sharpe Ratio)

Classical Time-Series Models (ARIMA, GARCH)

Deep Learning for Sequences (LSTMs, Transformers)

Stationarity & Memory in Markets (ADF Test, Frac. Diff.)

Building Trading Signals from Predictions (Meta-Labeling)

Credit Default Prediction & Scoring

Anomaly & Financial Fraud Detection (Isolation Forests, Autoencoders)

Modeling Volatility & Value-at-Risk (VaR)

Model Explainability & Interpretability (SHAP, LIME)

Financial Sentiment Analysis (News, Earnings Reports, Tweets)

Information Extraction (NER, Topic Modeling)

Advanced Text Representation (Word2Vec, Transformers - BERT)

Integrating NLP Signals into Trading Models

Reinforcement Learning for Optimal Trading

Portfolio Optimization with ML (Covariance Estimation)

Leveraging Alternative Data (Satellite Imagery, Web Data)

AI Ethics & Regulation in Finance