lvchen727 / computer-vision-notes

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

computer-vision-notes

CV basics

  1. low-pass filtering for image smoothing and noise reduction

  2. high-pass filtering for enhancing and edge detection

Feature descriptors

There are two parts for SIFT:

  • Part I: locate SIFT detectors(keypoints)

    1. Scale-space extrema detection: arch over multiple scales and image locations
    2. Keypoint localization: a model to detrmine location and scale.
    3. Orientation assignment: mpute best orientation(s) for each keypoint region.
  • Part II: compute feature descriptor 4. Keypoint description: e local image gradients at selected scale and rotation to describe each keypoint region.

    SIFT computes the descriptor by taking a 16x16 block of pixels centered at a key point, dividing it into 4x4 cells, and computing an 8-bin histogram of gradient orientations within each cell(the length of each arrow corresponding to the sum of the gradient magnitudes near that direction within the region using "guassian weighted function"). This results in a 448=128-element vector, which is the descriptor.

Great notes:

Histogram of Oriented Gradients(HOG)

CNN

Feature encoders

BOVW

FV

References

About