Yao Zhou's starred repositories

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:51706Issues:935Issues:1077

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49223Issues:561Issues:202

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23606Issues:252Issues:288

rembg

Rembg is a tool to remove images background

Language:PythonLicense:MITStargazers:15683Issues:141Issues:494

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12873Issues:86Issues:833

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10882Issues:98Issues:352

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9378Issues:79Issues:107

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7612Issues:36Issues:387
Language:PythonLicense:Apache-2.0Stargazers:7039Issues:67Issues:69

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5832Issues:38Issues:77

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5790Issues:47Issues:75

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5191Issues:38Issues:37

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4513Issues:45Issues:390

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3895Issues:114Issues:73

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3698Issues:43Issues:386

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2571Issues:37Issues:50

gemma

Open weights LLM from Google DeepMind.

Language:PythonLicense:Apache-2.0Stargazers:2276Issues:32Issues:27

mpire

A Python package for easy multiprocessing, but faster than multiprocessing

Language:PythonLicense:MITStargazers:1975Issues:15Issues:83

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonLicense:Apache-2.0Stargazers:1965Issues:30Issues:227

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1135Issues:19Issues:62

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:593Issues:6Issues:76

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:585Issues:7Issues:90

text-dedup

All-in-one text de-duplication

Language:PythonLicense:Apache-2.0Stargazers:558Issues:4Issues:57

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:517Issues:8Issues:18

the-stack-v2

Code for the curation of The Stack v2 and StarCoder2 training data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:79Issues:5Issues:5

GeoReasoner

GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode

Stargazers:17Issues:0Issues:0