Beast code in Giters

gsyzycww's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonNOASSERTION27349 162 361

onnx

Open standard for machine learning interoperability

Language:PythonApache-2.017242 435 2737

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.014961 103 954

Scrapegraph-ai

Python scraper based on AI

Language:PythonMIT12888 86 164

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonMIT8591 56 3189

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonAGPL-3.08136 40 282

llama-cpp-python

Python bindings for llama.cpp

Language:PythonMIT7150 66 974

Omost

Your image is almost there!

Language:PythonApache-2.06844 40 65

automatic

SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

Language:PythonAGPL-3.05297 58 2003

ToonCrafter

a research paper for generative cartoon interpolation

Language:PythonApache-2.04768 52 39

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonMIT2247 28 46

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonBSD-2-Clause2172 20 41

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:Python2040 39 42

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonNOASSERTION1866 38 54

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonApache-2.01474 27 114

FinRobot

FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀

Language:Jupyter NotebookApache-2.01179 18 15

AutoRAG

RAG AutoML Tool - Find optimal RAG pipeline for your own data.

Language:PythonApache-2.01129 14 291

LLaVA-NeXT

Language:Python1119 21 86

HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents.

Language:PythonMIT903 12 12

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonMIT900 8 40

AnimateAnyone

Unofficial Implementation of Animate Anyone by Novita AI

Language:PythonApache-2.0703 11 9

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonApache-2.0599 11 28

LookOnceToHear

A novel human-interaction method for real-time speech extraction on headphones.

Language:PythonNOASSERTION502 10 1

Era3D

Language:PythonAGPL-3.0443 15 25

TeleSpeech-ASR

Language:Python393 10 35

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonNOASSERTION387 8 5

Vista

A Generalizable World Model for Autonomous Driving

Language:PythonApache-2.0374 18 16

Deblur-GS

[I3D 2024] Deblur-GS: 3D Gaussian Splatting from Camera Motion Blurred Images

Language:PythonNOASSERTION303 6 10

stream-wav2lip

优化wav2lip的执行步骤，将头脸分离、嘴型替换、回补背景三个步骤分离，添加gfpgan强化面部功能，实现提前解帧，流式循环处理，对接obs

Language:PythonApache-2.01700

RTMPAddRes

RTMP video add audio or picture or text online and output to user client

Language:Python100