🖼️

Understanding Feature Extraction in Images

Oct 12, 2024

Feature Extraction from Images

Introduction

Feature extraction is crucial in image data processing.
It involves understanding how a computer interprets image data, often stored as a matrix.

Simple Approach to Feature Extraction

Black and White Images

Images are interpreted as matrices.
Example: A 22x16 pixel image is stored as a 352 pixel matrix.
Pixel values range from 0 (black) to 255 (white).
Features can be represented by flattening the matrix into a 1x352 vector.

Color Images

Stored as three matrices (channels) for blue, green, and red.
Similar approach by calculating mean pixel values across channels.

Limitations

Simple approach is prone to noise (e.g., background, color differences, expressions).
Computationally expensive and retains unnecessary information.

Advanced Feature Extraction: HOG (Histogram of Oriented Gradients)

Concept

HOG computes pixel-wise gradients and orientations.
Represents images in a noise-reduced form.

Steps to Build HOG Representation

1. Image Pre-processing

Standardize image size, e.g., resizing to 32x64 pixels.
Divide resized image into smaller cells, e.g., 4x4 pixels.

2. Calculating Gradients and Orientations

Compute gradients in x (gx) and y (gy) directions.
Calculate gradient magnitude and orientation.
Use Pythagoras theorem for magnitude and inverse tangent for orientation.

3. Plotting Histograms

Orientations are binned (e.g., 20-degree bins, 9 bins total).
Distribute gradient magnitudes into bins based on orientation.
Results in a 1x9 feature matrix per cell.

4. Normalization

Form blocks of cells for normalization to reduce sensitivity to lighting variations.
Calculate block features and normalize using root sum of squares.
Shift blocks across the image to compute normalized vectors for all blocks.

Feature Matrix

HOG results in a 1x3780 dimensional image descriptor.

Dimensionality Reduction

The high dimensionality of features can lead to computational expense and overfitting.
PCA (Principal Component Analysis) is used to reduce dimensions.

Conclusion

HOG is an effective feature descriptor for minimizing noise and focusing on crucial information in images.

Full transcript