computer-vision-notes

CV basics

low-pass filtering for image smoothing and noise reduction
- spatial averaging
- Guassian filter (non-uniform low-pass filter)
  - guassian filter note1
  - guassian filter note2
- why use Guassian filter for imaging processing
high-pass filtering for enhancing and edge detection
- Laplacian of Guassian (LoG)
- Difference of Guassian (DoG)
  - G is a close approximate to scale-normalized LoG.

There are two parts for SIFT:

Part I: locate SIFT detectors(keypoints)
1. Scale-space extrema detection: arch over multiple scales and image locations
2. Keypoint localization: a model to detrmine location and scale.
3. Orientation assignment: mpute best orientation(s) for each keypoint region.
Part II: compute feature descriptor 4. Keypoint description: e local image gradients at selected scale and rotation to describe each keypoint region.

SIFT computes the descriptor by taking a 16x16 block of pixels centered at a key point, dividing it into 4x4 cells, and computing an 8-bin histogram of gradient orientations within each cell(the length of each arrow corresponding to the sum of the gradient magnitudes near that direction within the region using "guassian weighted function"). This results in a 448=128-element vector, which is the descriptor.

Great notes: