Depth anything in Python

Question

VladimirYugay opened this issue 3 months ago · comments

Amazing demo for the depth-anything!

I want to have a similar point cloud, but in Python, and wondering what's the logic behind your js implementation.

Specifically:

How do you set up the intrinsic matrix and backproject the depth map and color to the 3D space?
What is the difference between Xenova/depth-anything-small-hf and LiheYoung/depth-anything-small-hf?

Joshua Lochner · Answer 1 · Wed Mar 20 2024 00:21:52 GMT+0800 (China Standard Time)

Fortunately three.js handles all the complicated 3D scene management for me. All the logic can be found in the setupScene function here. If you want to achieve something similar in python, you can use the gradio 3D model component. You will need to convert the depth image to a glb trimesh (see example code here).
As stated in the README of Xenova/depth-anything-small-hf, it is the same as LiheYoung/depth-anything-small-hf, but with ONNX weights to be compatible with transformers.js (so it can run in the browser).