K-Nearest Neighbors (KNN)

Overview

Calculate Distances:
- Measure the distance between the given point and other points.
- Common Distance Function: Euclidean distance.
Sort Neighbors:
- Sort the neighbors by distance in increasing order.
Classification:
- Classify the point by majority vote of its k-nearest neighbors.
- Assign the point to the most common class among its neighbors.
Key Value (k):
- Controls the balance between overfitting and underfitting.
- Optimal k can be determined using cross-validation and learning curves.
- Trade-offs:
  - Small k: Low bias, high variance.
  - Large k: High bias, low variance.
  - Important to find a balanced k.
Regression:
- For regression, return the average of the k nearest neighbors' labels as the prediction.

Two plots from the code example:
- Left Plot: Classification decision boundary with k=15.
- Right Plot: Classification decision boundary with k=3.

This lecture is part of the Betasize ML concept from Intuitive Machine Learning.
Encourage viewers to comment, like, and subscribe for more learning content.