jzyztzn's starred repositories

StableDiffusionOnDevice

本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。

Language:C++License:MITStargazers:68Issues:0Issues:0

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonLicense:MITStargazers:335Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29710Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:823Issues:0Issues:0

re2

RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.

Language:C++License:BSD-3-ClauseStargazers:8808Issues:0Issues:0

CLIP_benchmark

CLIP-like model evaluation

Language:Jupyter NotebookLicense:MITStargazers:547Issues:0Issues:0

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonLicense:MITStargazers:2138Issues:0Issues:0

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12311Issues:0Issues:0

Text2Image-Retrieval

计算机视觉课程设计-基于Chinese-CLIP的图文检索系统

Language:PythonStargazers:31Issues:0Issues:0

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:622Issues:0Issues:0

LocalLM

Android app for running transformers locally using LLama.cpp & Whisper.cpp

Language:KotlinLicense:GPL-3.0Stargazers:13Issues:0Issues:0

clip-image-search

A simple image search engine using CLIP feature.

Language:PythonLicense:MITStargazers:47Issues:0Issues:0

CLIP-ImageSearch-NCNN

CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android

Language:C++Stargazers:200Issues:0Issues:0

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2288Issues:0Issues:0

maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Language:DartLicense:MITStargazers:1080Issues:0Issues:0

CLIP-Chinese

中文CLIP预训练模型

Language:PythonStargazers:371Issues:0Issues:0

ollama-app

A modern and easy-to-use client for Ollama

Language:DartLicense:Apache-2.0Stargazers:301Issues:0Issues:0

OllamaDroid

A Ollama client for Android!

Language:JavaStargazers:69Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27869Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6011Issues:0Issues:0

mnn-segment-anything

segment-anything based mnn

Language:C++Stargazers:31Issues:0Issues:0
Language:C++Stargazers:21Issues:0Issues:0

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Language:SvelteLicense:MITStargazers:33144Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3109Issues:0Issues:0

all-seeing

[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"

Language:PythonStargazers:429Issues:0Issues:0

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2899Issues:0Issues:0

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1156Issues:0Issues:0

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonLicense:MITStargazers:2427Issues:0Issues:0

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2069Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11022Issues:0Issues:0