zhoukang's starred repositories

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1472Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47023Issues:0Issues:0

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:680Issues:0Issues:0

HOV-SG

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

Language:PythonLicense:MITStargazers:171Issues:0Issues:0
Language:PythonStargazers:108Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19633Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

ai-comic-factory

Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗

Language:TypeScriptLicense:Apache-2.0Stargazers:1003Issues:0Issues:0

Story-to-comic-AI

create any comic page using state-of-the-art text to image and large language models with your limitless imagination

Language:PythonStargazers:6Issues:0Issues:0

MJOLNIR

Python implementation of the paper Learning hierarchical relationships for object-goal navigation

Language:PythonStargazers:40Issues:0Issues:0

zson

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022

Language:PythonStargazers:61Issues:0Issues:0

PSL-InstanceNav

official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"

Language:PythonStargazers:13Issues:0Issues:0

Pixel-Navigator

Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024

Language:PythonStargazers:64Issues:0Issues:0

Layout-based-sTDE

Layout-based Causal Inference for Object Navigation (CVPR 2023)

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

3DAwareNav

[CVPR 2023] We propose a framework for the challenging 3D-aware ObjectNav based on two straightforward sub-policies. The two sub-polices, namely corner-guided exploration policy and category-aware identification policy, simultaneously perform by utilizing online fused 3D points as observation.

Language:PythonLicense:MITStargazers:56Issues:0Issues:0

AKGVP

Aligning Knowledge Graph with Visual Perception for Object-goal Navigation (ICRA 2024)

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

GaussNav

PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation

Language:PythonStargazers:48Issues:0Issues:0
Stargazers:1Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:2028Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4077Issues:0Issues:0

ClipCap-Chinese

基于ClipCap的看图说话Image Caption模型

Language:PythonStargazers:277Issues:0Issues:0

SGM

Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Language:PythonLicense:Apache-2.0Stargazers:2000Issues:0Issues:0

MatterSim_BEVBert_Docker

This is a docker which contain both MatterSim and the BEVBert.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Matterport3DSimulator

AI Research Platform for Reinforcement Learning from Real Panoramic Images.

Language:C++License:NOASSERTIONStargazers:491Issues:0Issues:0

Demand-driven-navigation

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation

Language:PythonStargazers:43Issues:0Issues:0

visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Language:PythonLicense:MITStargazers:545Issues:0Issues:0

BEV-Scene-Graph

[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation

Language:PythonStargazers:116Issues:0Issues:0

csg-os

Commonsense Scene Graph-based Target Localization for Object Search

Language:PythonStargazers:9Issues:0Issues:0

spatial_attention

Visual Navigation with Spatial Attention

Language:PythonLicense:Apache-2.0Stargazers:33Issues:0Issues:0