Qifan Wu (scm573)

scm573

Geek Repo

Location:Osaka

Github PK Tool:Github PK Tool

Qifan Wu's starred repositories

tile-merger

A tile merger, for merging multiple images into a single image. Written in C# for the .NET framework for Windows operating systems.

Language:C#Stargazers:15Issues:0Issues:0

StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

Language:C#License:AGPL-3.0Stargazers:4446Issues:0Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:52074Issues:0Issues:0
Language:CStargazers:49Issues:0Issues:0

1ZLAB_Face_Track_Robot

二自由度云台实现人脸追踪。 首先是使用一款名字叫做IP摄像头的APP 采集手机摄像头的图像,在手机上建立一个视频流服务器。在局域网下,PC通过IP还有端口号获取图像。使用OpenCV的人脸检测的API获取人脸在画面中的位置,根据人脸位置距离画面中心的x轴与y轴的偏移量(offset) ,通过P比例控制(PID控制中最简单的一种)控制二自由度云台上臂与下臂的旋转角度,将角度信息通过串口通信UART发送给ESP32单片机(不限于ESP32,STM32,Arduino都可以)解析执行对应的操作,从而使得人脸尽可能处在画面的正中间。

Language:PythonLicense:GPL-3.0Stargazers:184Issues:0Issues:0

unity-AI-Chat-Toolkit

使用unity实现AI聊天相关功能。目前这个库包含了对chatgpt、chatglm等大语言模型的api调用的代码实现以及实现了微软Azure以及百度AI的语音服务功能,语音服务均采用web api实现,支持Windows/WebGL/Android等平台

License:MITStargazers:442Issues:0Issues:0

open-webui

User-friendly WebUI for AI (Formerly Ollama WebUI)

Language:SvelteLicense:MITStargazers:40893Issues:0Issues:0

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:91513Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Language:PythonLicense:MITStargazers:7581Issues:0Issues:0

VITS

ACG Text-to-Speech

Language:PythonLicense:MITStargazers:176Issues:0Issues:0

cobalt

best way to save what you love

Language:SvelteLicense:AGPL-3.0Stargazers:14831Issues:0Issues:0

MoeGoe

Executable file for VITS inference

Language:PythonLicense:MITStargazers:2332Issues:0Issues:0

MoeTTS

Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc

License:GPL-3.0Stargazers:968Issues:0Issues:0

generative-ai-android

The official Android library for the Google Gemini API

Language:KotlinLicense:Apache-2.0Stargazers:711Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7096Issues:0Issues:0

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

Stargazers:28589Issues:0Issues:0

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6583Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38677Issues:0Issues:0

SwiftInfer

Efficient AI Inference & Serving

Language:PythonLicense:Apache-2.0Stargazers:452Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31723Issues:0Issues:0

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonLicense:AGPL-3.0Stargazers:4705Issues:0Issues:0

upscayl

🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.

Language:TypeScriptLicense:AGPL-3.0Stargazers:30178Issues:0Issues:0

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:6173Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:4192Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:35486Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20689Issues:0Issues:0

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

License:Apache-2.0Stargazers:14386Issues:0Issues:0

waifuc

Efficient Train Data Collector for Anime Waifu

Language:PythonLicense:MITStargazers:264Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:23431Issues:0Issues:0