Luowei Zhou (LuoweiZhou)

LuoweiZhou

Geek Repo

Company:Google

Home Page:https://luoweizhou.github.io

Github PK Tool:Github PK Tool


Organizations
deepvision-class
MichiganCOG

Luowei Zhou's starred repositories

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:43661Issues:898Issues:627

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:40230Issues:308Issues:1497

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellLicense:Apache-2.0Stargazers:25217Issues:305Issues:254

mojo

The Mojo Programming Language

Language:MojoLicense:NOASSERTIONStargazers:22908Issues:266Issues:2044

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20634Issues:203Issues:372

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

triton

Development repository for the Triton language and compiler

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11961Issues:101Issues:517

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11663Issues:147Issues:816

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10846Issues:64Issues:244

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Language:PythonLicense:MITStargazers:7562Issues:81Issues:151

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7523Issues:86Issues:93

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6548Issues:63Issues:80

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5882Issues:65Issues:421

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5525Issues:63Issues:98

pinokio

AI Browser

Language:JavaScriptLicense:MITStargazers:3290Issues:50Issues:223

LLM-As-Chatbot

LLM as a Chatbot Service

Language:PythonLicense:Apache-2.0Stargazers:3281Issues:53Issues:66

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2122Issues:29Issues:138

Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

Language:PythonLicense:MITStargazers:1721Issues:22Issues:31

basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

Language:PythonLicense:MITStargazers:1291Issues:22Issues:59

HeyGenClone

A simple and open-source analogue of the HeyGen system

AgentSims

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Language:PythonLicense:MITStargazers:745Issues:4Issues:25

universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Language:PythonLicense:MITStargazers:608Issues:19Issues:57

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

clippinator

AI programming assistant

Spider2-V

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:100Issues:3Issues:1

RandBox

[ICCV 2023] PyTorch implementation of RandBox