KMedia's repositories

hwinfo

cross platform C++ library for hardware information (CPU, RAM, GPU, ...)

License:MITStargazers:0Issues:0Issues:0

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

License:Apache-2.0Stargazers:0Issues:0Issues:0

RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

License:GPL-3.0Stargazers:0Issues:0Issues:0

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

License:MITStargazers:0Issues:0Issues:0

nv-codec-headers

automatic mirror of https://git.videolan.org/?p=ffmpeg/nv-codec-headers.git

Stargazers:0Issues:0Issues:0

TNN

TNN: developed by Tencent Youtu Lab an

License:NOASSERTIONStargazers:0Issues:0Issues:0

DirectX-Graphics-Samples

This repo contains the DirectX Graphics samples that demonstrate how to build graphics intensive applications on Windows.

License:MITStargazers:0Issues:0Issues:0

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

License:NOASSERTIONStargazers:0Issues:0Issues:0

onnx-models

A collection of pre-trained, state-of-the-art models in the ONNX format

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx-tool

A parser, editor and profiler tool for ONNX models.

License:MITStargazers:0Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

Applio

Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.

License:NOASSERTIONStargazers:0Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

License:GPL-3.0Stargazers:0Issues:0Issues:0

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

License:GPL-3.0Stargazers:0Issues:0Issues:0

ComfyUI-Video-Matting

A minimalistic implementation of Robust Video Matting (RVM) and BRAIAI-RVMBG v1.4 in ComfyUI

License:GPL-3.0Stargazers:0Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

License:NOASSERTIONStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

License:MITStargazers:0Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

License:Apache-2.0Stargazers:0Issues:0Issues:0

ONNX-Models2

ONNX-Models zoo

Stargazers:0Issues:0Issues:0

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

License:MITStargazers:0Issues:0Issues:0

CoreML-Models

Converted CoreML Model Zoo.

Stargazers:0Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

License:Apache-2.0Stargazers:0Issues:0Issues:0

BackgroundMattingV2

Real-Time High-Resolution Background Matting

License:MITStargazers:0Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

License:NOASSERTIONStargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

License:Apache-2.0Stargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

License:MITStargazers:0Issues:0Issues:0