geekwish's repositories
agentscope
Start building LLM-empowered multi-agent applications in an easier way.
agentUniverse
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
awesome-ai-agents
A list of AI autonomous agents
backgroundremover
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
CharacterGen
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
ComfyUI-IC-Light
Using IC-LIght models in ComfyUI
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
dust3r
DUSt3R: Geometric 3D Vision Made Easy
EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Kolors
Kolors Team
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
outfit-anyone
Outfit Anyone(最新修复版): Ultra-high quality virtual try-on for Any Clothing and Any Person
Paints-UNDO
Understand Human Behavior to Align True Needs
persuasive_jailbreaker
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
PhotoMaker
PhotoMaker [CVPR 2024]
Rope
GUI-focused roop
snap_wtf_macos
WTF Snapshot fuzzing of macOS targets