Yao Zhou's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:52353Issues:938Issues:1080

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49473Issues:562Issues:209

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:24305Issues:256Issues:305

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:18924Issues:113Issues:1255

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:16893Issues:77Issues:229

rembg

Rembg is a tool to remove images background

Language:PythonLicense:MITStargazers:16528Issues:148Issues:503

surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:10070Issues:86Issues:134

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7770Issues:37Issues:420

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5950Issues:40Issues:86

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5697Issues:52Issues:567

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5263Issues:39Issues:37

torchtune

PyTorch native finetuning library

Language:PythonLicense:BSD-3-ClauseStargazers:4103Issues:46Issues:593

VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4054Issues:115Issues:81

Queryable

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Language:SwiftLicense:MITStargazers:2626Issues:16Issues:36

gemma

Open weights LLM from Google DeepMind.

Language:PythonLicense:Apache-2.0Stargazers:2422Issues:32Issues:32

mpire

A Python package for easy multiprocessing, but faster than multiprocessing

Language:PythonLicense:MITStargazers:2004Issues:15Issues:83

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1182Issues:18Issues:63

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:638Issues:7Issues:81

text-dedup

All-in-one text de-duplication

Language:PythonLicense:Apache-2.0Stargazers:594Issues:4Issues:67

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:562Issues:8Issues:23

llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Language:PythonLicense:Apache-2.0Stargazers:548Issues:12Issues:62

KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

Language:CLicense:MITStargazers:195Issues:2Issues:7

Full-Segment-Anything

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size

Language:PythonLicense:MITStargazers:134Issues:2Issues:9

the-stack-v2

Code for the curation of The Stack v2 and StarCoder2 training data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:89Issues:5Issues:5

GeoReasoner

GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode