LuoweiZhou

Luowei Zhou's starred repositories

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonMIT43661 898 627

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.040230 308 1497

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellApache-2.025217 305 254

mojo

The Mojo Programming Language

Language:MojoNOASSERTION22908 266 2044

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20634 203 372

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.016245 132 125

triton

Development repository for the Triton language and compiler

Language:C++MIT12763 189 1412

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonApache-2.011961 101 517

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION11663 147 816

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.010846 64 244

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python10313 167 655

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Language:PythonMIT7562 81 151

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookApache-2.07523 86 93

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6548 63 80

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.05882 65 421

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5525 63 98

pinokio

AI Browser

Language:JavaScriptMIT3290 50 223

LLM-As-Chatbot

LLM as a Chatbot Service

Language:PythonApache-2.03281 53 66

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3168 127 18

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python2893 33 132

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonApache-2.02122 29 138

Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

Language:PythonMIT1721 22 31