xxyyboy

xxyyboy

Geek Repo

Location:ShangHai

Github PK Tool:Github PK Tool

xxyyboy's repositories

Easy-Wav2Lip

Colab for making Wav2Lip high quality and easy to use

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

imgutils

A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AI-Vtuber

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。

License:GPL-3.0Stargazers:0Issues:0Issues:0

bark-gui

🔊 Text-Prompted Generative Audio Model with Gradio

License:MITStargazers:0Issues:0Issues:0

Bert-VITS2-Integration-package

vits2 backbone with bert

License:GPL-3.0Stargazers:0Issues:0Issues:0

chatbot

ChatGPT带火了聊天机器人,主流的趋势都调整到了GPT类模式,本项目也与时俱进,会在近期更新GPT类版本。基于本项目和自己的语料可以训练出自己想要的聊天机器人,用于智能客服、在线问答、闲聊等场景。

Stargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepDanbooru

AI based multi-label girl image classification system, implemented by using TensorFlow.

License:MITStargazers:0Issues:0Issues:0

DragGAN

Online Demo and Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

Stargazers:0Issues:0Issues:0

dssim

Image similarity comparison simulating human perception (multiscale SSIM in Rust)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

License:MITStargazers:0Issues:0Issues:0

face_recognition

The world's simplest facial recognition api for Python and the command line

License:MITStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

License:MITStargazers:0Issues:0Issues:0

gpt4free

decentralising the Ai Industry, just some language model api's...

License:GPL-3.0Stargazers:0Issues:0Issues:0

HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

License:Apache-2.0Stargazers:0Issues:0Issues:0

img2webp_img2avif

image to webp, image to avif

Language:CStargazers:0Issues:0Issues:0

img_apng2webp

convert apng png to webp

Language:CStargazers:0Issues:0Issues:0

Interview-for-Algorithm-Engineer

【三年面试五年模拟】算法工程师秘籍。AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、图像处理、元宇宙、AGI、SLAM等AI行业面试笔试经验分享

License:GPL-3.0Stargazers:0Issues:0Issues:0

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

License:Apache-2.0Stargazers:0Issues:0Issues:0

LightDiffusionFlow

This extension is developed for AUTOMATIC1111's Stable Diffusion web UI that provides import/export options for parameters.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

License:MITStargazers:0Issues:0Issues:0

LoRA_Easy_Training_Scripts

A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy

License:GPL-3.0Stargazers:0Issues:0Issues:0

metahuman-stream

Real time interactive streaming digital human

License:Apache-2.0Stargazers:0Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

webp_server_go

Go version of WebP Server. A tool that will serve your JPG/PNGs as WebP format with compression, on-the-fly.

License:GPL-3.0Stargazers:0Issues:0Issues:0

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

License:GPL-3.0Stargazers:0Issues:0Issues:0