gsyzycww's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:27349Issues:162Issues:361

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:17242Issues:435Issues:2737

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14961Issues:103Issues:954

Scrapegraph-ai

Python scraper based on AI

Language:PythonLicense:MITStargazers:12888Issues:86Issues:164

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonLicense:MITStargazers:8591Issues:56Issues:3189

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:8136Issues:40Issues:282

llama-cpp-python

Python bindings for llama.cpp

Language:PythonLicense:MITStargazers:7150Issues:66Issues:974

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:6844Issues:40Issues:65

automatic

SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

Language:PythonLicense:AGPL-3.0Stargazers:5297Issues:58Issues:2003

ToonCrafter

a research paper for generative cartoon interpolation

Language:PythonLicense:Apache-2.0Stargazers:4768Issues:52Issues:39

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonLicense:MITStargazers:2247Issues:28Issues:46

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonLicense:BSD-2-ClauseStargazers:2172Issues:20Issues:41

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:1866Issues:38Issues:54

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1474Issues:27Issues:114

FinRobot

FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1179Issues:18Issues:15

AutoRAG

RAG AutoML Tool - Find optimal RAG pipeline for your own data.

Language:PythonLicense:Apache-2.0Stargazers:1129Issues:14Issues:291

HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents.

Language:PythonLicense:MITStargazers:903Issues:12Issues:12

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonLicense:MITStargazers:900Issues:8Issues:40

AnimateAnyone

Unofficial Implementation of Animate Anyone by Novita AI

Language:PythonLicense:Apache-2.0Stargazers:703Issues:11Issues:9

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:599Issues:11Issues:28

LookOnceToHear

A novel human-interaction method for real-time speech extraction on headphones.

Language:PythonLicense:NOASSERTIONStargazers:502Issues:10Issues:1
Language:PythonLicense:AGPL-3.0Stargazers:443Issues:15Issues:25

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonLicense:NOASSERTIONStargazers:387Issues:8Issues:5

Vista

A Generalizable World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:374Issues:18Issues:16

Deblur-GS

[I3D 2024] Deblur-GS: 3D Gaussian Splatting from Camera Motion Blurred Images

Language:PythonLicense:NOASSERTIONStargazers:303Issues:6Issues:10

stream-wav2lip

优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs

Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

RTMPAddRes

RTMP video add audio or picture or text online and output to user client

Language:PythonStargazers:1Issues:0Issues:0