tfgbestneal's repositories
agentsflow
Drag & drop UI to build and run a flow of autogen AI agents
Anything-3D
Segment-Anything + 3D. Let's lift anything to 3D.
Auto-Annotation-Using-YOLOv8-and-SAm
Auto Annotation for generating segmentation dataset using YOLOv8 & SAM
ARKit-CoreLocation
Combines the high accuracy of AR with the scale of GPS data.
clip-distillation
Zero-label image classification via OpenCLIP knowledge distillation
Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
drive-any-robot
Official code and checkpoint release for "GNM: A General Navigation Model to Drive Any Robot".
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
extreme-parkour
Train your parkour robot in less than 20 hours.
ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
open_vins
An open source platform for visual-inertial navigation research.
OpenCDA
A generalized framework for prototyping full-stack cooperative driving automation applications under CARLA+SUMO.
openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
ORB_SLAM2
Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities
ORB_SLAM3
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
parkour
[CoRL 2023] Robot Parkour Learning
reachy_2023
Reachy 2023 workspace
SAM_gDINO_AutoLabeling
Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO
segment-anything-eo
Earth observation tools for Meta AI Segment Anything
segment-geospatial
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
SegmentAnyRGBD
Segment Any RGBD
Video-LLaMA
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
visualnav-transformer
Official code and checkpoint release for "ViNT: A Foundation Model for Visual Navigation".
X-Decoder
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
yolo8-tracking-counting-speed_estimation
Tracking, counting and speed estimation using yolo8