Monocular Visual Odometry

Viam module for monocular visual odometry implemented as a movement sensor.

Getting started

All you need is a calibrated camera.

This module implements two methods of the movement sensor API:

GetLinearVelocity()
GetAngularVelocity()

Please note that GetLinearVelocity returns an estimation of the instantaneous linear velocity without scale factor. Hence, units should not be trusted and GetLinearVelocity() should serve as direction estimation.

Installation

Option 1: `poetry`

cd viam-visual-odometry
poetry install

In run.sh, uncomment the line:

#exec poetry run python -m src.main $@

and remove the line:

exec python3 -m src.main $@

Option 2 : `pip install`

pip install -r requirements.txt

Config

Example config

{
  "modules": [
    {
      "name": "my-odometry",
      "executable_path": "/path/to/run.sh", 
      "type" : "local"
    }
  ],
  "components": [
    {
      "name": "cam",
      "type": "camera",
      "model": "webcam",
      "attributes": {
        "height_px": 720,
        "width_px": 1280,
        "intrinsic_parameters": {
          "ppx": 446,
          "ppy": 585,
          "fx": 1055,
          "fy": 1209
        },
        "distortion_parameters": {
          "rk3": -0.03443,
          "tp1": 0.01364798,
          "tp2": -0.0107569,
          "rk1": -0.1621,
          "rk2": 0.13632
        }
      },
      "depends_on": []
    },
    {
      "namespace": "rdk",
      "model": "viam:visual_odometry:opencv_orb",
      "attributes": {
        "camera_name": "cam", 
        "time_between_frames_s": 0.2, 
        "lowe_ratio_threshold": 0.75
      },
      "depends_on": [],
      "name": "visual_odometry1",
      "type": "movement_sensor"
    }
  ]
}

The camera needs to have intrinsics parameters. You can follow these instructions to calibrate your camera. If you want to grab the module from the registry, change the modules field to:

  "modules": [
    {
      "version": "^0.0.6",
      "module_id": "viam:monocular-visual-odometry",
      "name": "my-odometry",
      "type": "registry"
    }
  ]

Attributes description

The following attributes are available to configure your Visual odometry module:

Name	Type	Inclusion	Default	Description
`camera_name`	string	Required		Camera name to be used for inferring the motion.
`time_between_frames_s`	float	Optional	`0.1`	Target time between two successive frames given in seconds. Depending on the inference time and the time to get an image, the sleeping time after each inference will be auto-tuned to reach this target. Also, if the time between two successive frame is 5x larger than `time_between_frames_s`, another frame will be requested. This value depends on the speed of your system.
`orb_n_features`	int	Optional	`10000`	Maximum number of features to retain.
`orb_edge_threshold`	int	Optional	`31`	Size of the border where the features are not detected. It should roughly match the `orb_patch_size` attribute.
`orb_patch_size`	int	Optional	`31`	Size of the patch used by the oriented BRIEF descriptor.
`orb_n_levels`	int	Optional	`8`	Number of pyramid levels.
`orb_first_level`	int	Optional	`0`	Level of pyramid to put source image to.
`orb_fast_threshold`	int	Optional	`20`	Fast threshold.
`orb_scale_factor`	float	Optional	`1.2`	Pyramid decimation ratio, greater than 1.
`orb_WTA_K`	int	Optional	`2`	Number of points that produce each element of the oriented BRIEF descriptor.
`matcher`	string	Optional	`"flann"`	Either `"flann"` for FLANN based matcher or `"BF"` for brute force matcher. The FLANN matcher will look for the two best matches using the KNN method so Lowe's ratio test can be performed afterward. The brute force matcher uses Hamming norm.
`lowe_ratio_threshold`	float	Optional	`0.8`	Threshold value to check if the best match is significantly better than the second best match. This value will not be used if the brute force matcher is chosen.
`ransac_prob`	float	Optional	`0.99`	Probability to find a subset without outliers in it. Defines the number of iterations to filter the outliers. The number of iterations is roughly given by $k = \frac{\log(1-p)}{\log(1-w^n)}$, where $n$ is the number of points, $w$ the ratio of inliers to total points.
`ransac_threshold_px`	float	Optional	`0.5`	Maximum error to be classified as inlier.

See the ORB openCV documentation for more details.

Deeper dive

The module works as follow:

Having a previous image with keypoints, request a new image.
Detect ORB keypoints in the new image.
Find matching keypoints in the two image using KNN or brute force matcher.
Filter matches (with Lowe's and RANSAC) and compute essential matrix.
Decompose essential matrix using chirality constraint.
Retrieve linear and angular velocities from the previous decomposition.

Matcher

Filtering operations

Coordinate system

The coordinate system used is as follows. X is pointing to the right of the camera, Y down, and Z forward.

Angular velocity calculation from rotation matrix

From the rotation matrix, we first compute the Euler angles as the serie of intrinsic rotations (= attached to the moving body) $ZXZ$, giving angles $\phi$, $\theta$ and $\psi$.

Angular velocities about the final $x,y,z$ coordinate system is given by:

$\omega_x = \dot{\phi} \sin\theta \sin\psi + \dot{\theta} \cos\psi$
$\omega_y = \dot{\phi} \sin\theta \cos\psi - \dot{\theta} \sin\psi$
$\omega_z = \dot{\phi} \cos\theta + \dot{\psi}$

For more details see here.

bhaney / viam-visual-odometry

Monocular Visual Odometry

Getting started

Installation

Option 1: `poetry`

Option 2 : `pip install`

Config

Example config

Attributes description

Deeper dive

Matcher

Filtering operations

Coordinate system

Angular velocity calculation from rotation matrix

References

Troubleshooting

About

Languages

Monocular Visual Odometry

Getting started

Installation

Option 1: poetry

Option 2 : pip install

Config

Example config

Attributes description

Deeper dive

Matcher

Filtering operations

Coordinate system

Angular velocity calculation from rotation matrix

References

Troubleshooting

About

Languages

Option 1: `poetry`

Option 2 : `pip install`