fabiopoiesi's starred repositories
Stirling-PDF
locally hosted web application that allows you to perform various operations on PDF files
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
SegmentAnything3D
[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes
PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
inbox_cleaner
A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.
cuda-bundle-adjustment
A CUDA implementation of Bundle Adjustment
Visual_Speech_Recognition_for_Multiple_Languages
Visual Speech Recognition for Multiple Languages
Multimodal-datasets
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information about recent multimodal datasets which are available for research purposes. We found that although 100+ multimodal language resources are available in literature for various NLP tasks, still publicly available multimodal datasets are under-explored for its re-usage in subsequent problem domains.
concept-fusion
Code release for ConceptFusion [RSS 2023]
libsoftwaresync
:camera: :camera: :camera: :camera: :camera: Wireless software synchronization of multiple distributed smartphone cameras.
mathematical_robotics
Optimization for Robotics
finding_berries
PyTorch implementation of the paper "Finding Berries: Segmentation and Counting of Cranberries using Point Supervision and Shape Priors". Peri Akiva, Kristin Dana, Peter Oudemans, Michael Mars. CVPRW2020.
VSR_test_set
WildVSR