bageyalu's starred repositories

CatVTON

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

Language:PythonLicense:NOASSERTIONStargazers:552Issues:0Issues:0

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonLicense:Apache-2.0Stargazers:3630Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:39Issues:0Issues:0

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:1009Issues:0Issues:0

Stable-Hair

Stable-Hair: Real-World Hair Transfer via Diffusion Model

License:Apache-2.0Stargazers:299Issues:0Issues:0

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Language:PythonLicense:Apache-2.0Stargazers:1531Issues:0Issues:0

neuralgcm

Hybrid ML + physics model of the Earth's atmosphere

Language:PythonLicense:Apache-2.0Stargazers:546Issues:0Issues:0

IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Language:PythonLicense:Apache-2.0Stargazers:872Issues:0Issues:0

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:15686Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:14728Issues:0Issues:0

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonLicense:GPL-3.0Stargazers:3377Issues:0Issues:0

Make-Your-Video

[IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance

Language:PythonLicense:NOASSERTIONStargazers:177Issues:0Issues:0

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language:PythonStargazers:164Issues:0Issues:0

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:651Issues:0Issues:0

pyecharts-gallery

Just use pyecharts to imitate Echarts official example.

Language:HTMLLicense:MITStargazers:1159Issues:0Issues:0

HanLP

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Language:PythonLicense:Apache-2.0Stargazers:33288Issues:0Issues:0

LexiLaw

LexiLaw - 中文法律大模型

Language:PythonLicense:MITStargazers:646Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:2128Issues:0Issues:0

DeepLabCut

Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

Language:PythonLicense:LGPL-3.0Stargazers:4495Issues:0Issues:0

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:4638Issues:0Issues:0

vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Language:PythonLicense:MITStargazers:10342Issues:0Issues:0

Table-LLaVA

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.

Language:PythonLicense:Apache-2.0Stargazers:105Issues:0Issues:0

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:11048Issues:0Issues:0

MaxKB

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。

Language:PythonLicense:GPL-3.0Stargazers:8845Issues:0Issues:0

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonLicense:Apache-2.0Stargazers:3018Issues:0Issues:0

MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Language:PythonLicense:NOASSERTIONStargazers:543Issues:0Issues:0

MeshXL

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation

Language:PythonStargazers:175Issues:0Issues:0

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:MITStargazers:3536Issues:0Issues:0

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookLicense:MITStargazers:318Issues:0Issues:0

DocProj

Document Rectification and Illumination Correction using a Patch-based CNN

Language:PythonLicense:MITStargazers:331Issues:0Issues:0