HuaZheLei's starred repositories

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:34592Issues:258Issues:1290

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptLicense:GPL-3.0Stargazers:29655Issues:283Issues:451

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:27770Issues:212Issues:512

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:22436Issues:182Issues:3470

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:15976Issues:152Issues:1241

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:9714Issues:98Issues:17

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:4945Issues:63Issues:360

agents

An Open-source Framework for Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:4495Issues:58Issues:66

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3856Issues:55Issues:289

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2842Issues:59Issues:84

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2060Issues:21Issues:19

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:1841Issues:27Issues:149

ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Language:PythonLicense:NOASSERTIONStargazers:1731Issues:32Issues:0

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

ControlNet-for-Diffusers

Transfer the ControlNet with any basemodel in diffusers🔥

Language:PythonLicense:MITStargazers:758Issues:15Issues:45

AgentSims

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Language:PythonLicense:MITStargazers:678Issues:3Issues:21

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonLicense:Apache-2.0Stargazers:617Issues:10Issues:22

tarsier

Vision utilities for web interaction agents 👀

Language:Jupyter NotebookLicense:MITStargazers:486Issues:2Issues:9

ScaleCrafter

[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Language:PythonLicense:Apache-2.0Stargazers:389Issues:17Issues:26

prompt-pretraining

Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"

Language:PythonLicense:Apache-2.0Stargazers:245Issues:5Issues:13

COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

ai-town-rwkv-proxy

Run a large AI town, locally, via RWKV !

PALI3

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"

Language:PythonLicense:MITStargazers:115Issues:4Issues:2
Language:PythonLicense:Apache-2.0Stargazers:58Issues:4Issues:6

AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Language:PythonLicense:Apache-2.0Stargazers:55Issues:2Issues:0

Lion

Lion: Kindling Vision Intelligence within Large Language Models

Q-Bench

An archived version of Q-Bench. We will make updates in https://github.com/q-future/Q-Bench in the future.

Language:Jupyter NotebookStargazers:10Issues:0Issues:0