Machine Learning

Principal Component Analysis

A dimensionality reduction technique that transforms data into a new coordinate system where the first axis captures the most variance, the second axis the next most, and so on.

Why It Matters

PCA reduces data complexity while preserving the most important information. It is essential for visualization, noise reduction, and speeding up downstream models.

Example

Reducing a dataset with 100 features down to 10 principal components that capture 95% of the total variance, making it much easier to visualize and model.

Think of it like...

Like summarizing a semester of lecture notes into a study guide — you identify the most important themes and discard the redundant details.

Related Terms