Beast code in Giters

bageyalu's starred repositories

CatVTON

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

Language:PythonNOASSERTION55200

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonApache-2.0363000

tesp

Language:PythonNOASSERTION3900

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonApache-2.0100900

Stable-Hair

Stable-Hair: Real-World Hair Transfer via Diffusion Model

Apache-2.029900

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Language:PythonApache-2.0153100

neuralgcm

Hybrid ML + physics model of the Earth's atmosphere

Language:PythonApache-2.054600

IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Language:PythonApache-2.087200

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonNOASSERTION1568600

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonMIT1472800

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonGPL-3.0337700

Make-Your-Video

[IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance

Language:PythonNOASSERTION17700

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language:Python16400

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonNOASSERTION65100

pyecharts-gallery

Just use pyecharts to imitate Echarts official example.

Language:HTMLMIT115900

HanLP

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Language:PythonApache-2.03328800

LexiLaw

LexiLaw - 中文法律大模型

Language:PythonMIT64600

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonApache-2.0212800

DeepLabCut

Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

Language:PythonLGPL-3.0449500

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonApache-2.0463800

vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Language:PythonMIT1034200

Table-LLaVA

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.

Language:PythonApache-2.010500