Beast code in Giters

Qifan Wu's starred repositories

tile-merger

A tile merger, for merging multiple images into a single image. Written in C# for the .NET framework for Windows operating systems.

Language:C#1500

StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

Language:C#AGPL-3.0444600

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.05207400

fashionstar-gimbal-2dof-stm32f103-openmv

STM32 OpenMV 云台

Language:C4900

二自由度云台实现人脸追踪。首先是使用一款名字叫做IP摄像头的APP 采集手机摄像头的图像，在手机上建立一个视频流服务器。在局域网下，PC通过IP还有端口号获取图像。使用OpenCV的人脸检测的API获取人脸在画面中的位置，根据人脸位置距离画面中心的x轴与y轴的偏移量(offset) ，通过P比例控制(PID控制中最简单的一种)控制二自由度云台上臂与下臂的旋转角度，将角度信息通过串口通信UART发送给ESP32单片机(不限于ESP32，STM32,Arduino都可以)解析执行对应的操作，从而使得人脸尽可能处在画面的正中间。

Language:PythonGPL-3.018400

unity-AI-Chat-Toolkit

使用unity实现AI聊天相关功能。目前这个库包含了对chatgpt、chatglm等大语言模型的api调用的代码实现以及实现了微软Azure以及百度AI的语音服务功能，语音服务均采用web api实现，支持Windows/WebGL/Android等平台

MIT44200

open-webui

User-friendly WebUI for AI (Formerly Ollama WebUI)

Language:SvelteMIT4089300

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Language:GoMIT9151300

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Language:PythonMIT758100

VITS

ACG Text-to-Speech

Language:PythonMIT17600

cobalt

best way to save what you love

Language:SvelteAGPL-3.01483100

MoeGoe

Executable file for VITS inference

Language:PythonMIT233200

MoeTTS

Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc

GPL-3.096800

generative-ai-android

The official Android library for the Google Gemini API

Language:KotlinApache-2.071100

LWM

Language:PythonApache-2.0709600

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

2858900

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT658300

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.03867700

SwiftInfer

Efficient AI Inference & Serving

Language:PythonApache-2.045200

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.03172300

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonAGPL-3.0470500

upscayl

🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.

Language:TypeScriptAGPL-3.03017800

threestudio

A unified framework for 3D content generation.

Language:PythonApache-2.0617300

GET3D

Language:PythonNOASSERTION419200

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT3548600

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2068900

scm573