jzyztzn's starred repositories
StableDiffusionOnDevice
本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。
CLIP_benchmark
CLIP-like model evaluation
clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Text2Image-Retrieval
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统
clip-image-search
A simple image search engine using CLIP feature.
CLIP-ImageSearch-NCNN
CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
CLIP-Chinese
中文CLIP预训练模型
ollama-app
A modern and easy-to-use client for Ollama
OllamaDroid
A Ollama client for Android!
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
DiffSynth-Studio
Enjoy the magic of Diffusion models!
mnn-segment-anything
segment-anything based mnn
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
all-seeing
[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
InternImage
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.