KMedia's repositories
hwinfo
cross platform C++ library for hardware information (CPU, RAM, GPU, ...)
MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
nv-codec-headers
automatic mirror of https://git.videolan.org/?p=ffmpeg/nv-codec-headers.git
TNN
TNN: developed by Tencent Youtu Lab an
DirectX-Graphics-Samples
This repo contains the DirectX Graphics samples that demonstrate how to build graphics intensive applications on Windows.
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
onnx-models
A collection of pre-trained, state-of-the-art models in the ONNX format
onnx-tool
A parser, editor and profiler tool for ONNX models.
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Applio
Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
ComfyUI-Video-Matting
A minimalistic implementation of Robust Video Matting (RVM) and BRAIAI-RVMBG v1.4 in ComfyUI
TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
OpenVoice
Instant voice cloning by MyShell.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
ONNX-Models2
ONNX-Models zoo
onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
CoreML-Models
Converted CoreML Model Zoo.
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
BackgroundMattingV2
Real-Time High-Resolution Background Matting
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
AnimateDiff
Official implementation of AnimateDiff.
bark
🔊 Text-Prompted Generative Audio Model