wxywb's starred repositories
localGPT-Vision
Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs
Awesome-Game-Analysis
a comprehensive collection of video game tech analysis resources
minimal-diffusion
A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)
tiny-diffusion
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
micro_diffusion
Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.
MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
BLINK_Benchmark
This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
buffer-of-thought-llm
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
Vision_by_Language
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
milvus-model
The embedding/reranking model zoo help user to convert their unstructured data into embeedings
Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Retriever-for-GPTs
An external retriever for GPTs implemented with Zilliz Cloud Pipelines, a more flexible and economic alternative to default GPTs knowledge base.