zhoukang12321

zhoukang's starred repositories

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookApache-2.0147200

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04702300

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookNOASSERTION68000

HOV-SG

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

Language:PythonMIT17100

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01963300

ai-comic-factory

Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗

Language:TypeScriptApache-2.0100300

Story-to-comic-AI

create any comic page using state-of-the-art text to image and large language models with your limitless imagination

Language:Python600

MJOLNIR

Python implementation of the paper Learning hierarchical relationships for object-goal navigation

Language:Python4000

zson

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022

Language:Python6100

PSL-InstanceNav

official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"

Language:Python1300

Pixel-Navigator

Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024

Language:Python6400

Layout-based-sTDE

Layout-based Causal Inference for Object Navigation (CVPR 2023)

Language:PythonMIT2500

[CVPR 2023] We propose a framework for the challenging 3D-aware ObjectNav based on two straightforward sub-policies. The two sub-polices, namely corner-guided exploration policy and category-aware identification policy, simultaneously perform by utilizing online fused 3D points as observation.

Language:PythonMIT5600