ZhenghaoFei's starred repositories

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23948Issues:317Issues:388

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14366Issues:115Issues:376

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13410Issues:128Issues:307

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9388Issues:76Issues:455

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8469Issues:60Issues:1440

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Language:Jupyter NotebookLicense:MITStargazers:3699Issues:72Issues:15

drake

Model-based design and verification for robotics.

Language:C++License:NOASSERTIONStargazers:3201Issues:174Issues:6190

UniAD

[CVPR'23 Best Paper Award] Planning-oriented Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:3176Issues:34Issues:171

act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Language:PythonLicense:MITStargazers:2875Issues:44Issues:52

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2638Issues:26Issues:153

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Language:PythonLicense:MITStargazers:2481Issues:37Issues:68

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:2103Issues:45Issues:169

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonLicense:Apache-2.0Stargazers:1741Issues:19Issues:93
Language:PythonLicense:MITStargazers:1331Issues:34Issues:24

diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Language:PythonLicense:MITStargazers:1177Issues:14Issues:82

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:683Issues:18Issues:66
Language:PythonLicense:MITStargazers:571Issues:18Issues:29

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonLicense:MITStargazers:536Issues:12Issues:114

apriltag_ros

A ROS wrapper of the AprilTag 3 visual fiducial detector

Language:C++License:NOASSERTIONStargazers:352Issues:13Issues:103

pybboxes

Light weight toolkit for bounding boxes providing conversion between bounding box types and simple computations.

Language:PythonLicense:MITStargazers:142Issues:3Issues:11

terrasentia-dataset

This dataset is intended for the evaluation of visual-based localization and mapping systems in agriculture.

tesse-core

Core components of TESSE to use as a submodule in a Unity project

Language:C#License:GPL-2.0Stargazers:12Issues:5Issues:2

strawberry-pp-w-r-dataset

This reop contains the dataset of strawberries picking pint, ripeness and weight annotations.

AutoVRL

AutoVRL is an open-source high fidelity simulator for simulation to real-world autonomous ground vehicle deep reinforcement learning research and development.

Language:PythonLicense:Apache-2.0Stargazers:6Issues:3Issues:0
Language:Jupyter NotebookLicense:GPL-3.0Stargazers:4Issues:1Issues:1