Beast code in Giters

Yman's starred repositories

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

GPL-3.0423100

ISAT_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Language:PythonNOASSERTION110300

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookCC-BY-4.02881000

APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Language:PythonGPL-3.077600

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonMIT1770200

ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

Language:PythonMIT185700

Rope

GUI-focused roop

Language:PythonGPL-3.0407000

TwitterMediaHarvest

Download twitter media with only one-click.

Language:TypeScriptMIT38300

XHS-Spider

小红书数据采集、网站图片、视频资源批量下载工具，颜值超高的数据采集工具（批量下载，视频提取，图片，去水印等）Telegram:https://t.me/+ZtLSwuIKTo44MDY1

GPL-3.066200

TikTokDownloader

TikTok 主页/合辑/直播/视频/图集/原声；抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具

Language:PythonGPL-3.0678100

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

537200

Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Language:PythonApache-2.099300

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION263900

roop

one-click deepfake (face swap)

Language:PythonGPL-3.0500

Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Language:PythonApache-2.0148500

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1087900

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.0617200

roop

one-click face swap

Language:PythonGPL-3.02582300

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonMIT86500

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION3463300

Realtime-Voice-Clone-Chinese

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION4600

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

GPL-3.0864200

YmanChris

Yman's starred repositories

ComfyUI-Workflows-ZHO

ISAT_with_segment_anything

autogen

APISR

crewAI

ComfyUI-3D-Pack

Rope

TwitterMediaHarvest

XHS-Spider

TikTokDownloader

OutfitAnyone

Hotshot-XL

LLaMA2-Accessory

roop

Macaw-LLM

Awesome-Multimodal-Large-Language-Models

video-retalking

roop

Wav2Lip-GFPGAN

xuniren

RAD-NeRF

MockingBird

Realtime-Voice-Clone-Chinese

Fay

SimSwap

MiniGPT-4

Ask-Anything

DragGAN

LiveSpeechPortraits

ChatGLM-Tuning