In our last lesson, we successfully used the Normal Equations, , as a powerful recipe to find the line of best fit. It felt clean, simple, and definitive. It is the textbook method and the foundation of our theoretical understanding of least squares.
However, when we move from small, neat textbook examples to the large, messy datasets of the real world, a dark side of the Normal Equations can emerge. Direct computation of can sometimes lead to serious numerical problems.
Today, we will play the role of a numerical analyst and investigate the primary weakness of this method: the problem of ill-conditioning.