Sleepy_chord's starred repositories
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
TeleVision
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
ExpertLLaMA
An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.
RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
Visual-LLaMA
Open LLaMA Eyes to See the World
ChatGPT-streamlit
Easy to use UI built with Streamlit for using ChatGPT, Claude, Stable Diffusion and beyond