jinghao666

followers

following

stars

KMedia's repositories

hwinfo

cross platform C++ library for hardware information (CPU, RAM, GPU, ...)

MIT000

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Apache-2.0000

RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

GPL-3.0000

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

MIT000

nv-codec-headers

automatic mirror of https://git.videolan.org/?p=ffmpeg/nv-codec-headers.git

000

TNN

TNN: developed by Tencent Youtu Lab an

NOASSERTION000

DirectX-Graphics-Samples

This repo contains the DirectX Graphics samples that demonstrate how to build graphics intensive applications on Windows.

MIT000

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

NOASSERTION000

onnx-models

A collection of pre-trained, state-of-the-art models in the ONNX format

Apache-2.0000

onnx-tool

A parser, editor and profiler tool for ONNX models.

MIT000

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

Applio

Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.

NOASSERTION000

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

GPL-3.0000

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

GPL-3.0000

ComfyUI-Video-Matting

A minimalistic implementation of Robust Video Matting (RVM) and BRAIAI-RVMBG v1.4 in ComfyUI

GPL-3.0000

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Apache-2.0000

OpenVoice

Instant voice cloning by MyShell.

NOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

MIT000

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Apache-2.0000

ONNX-Models2

ONNX-Models zoo

000

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

MIT000

CoreML-Models

Converted CoreML Model Zoo.

000

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Apache-2.0000

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

BSD-3-Clause000

magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

BSD-3-Clause000

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Apache-2.0000

BackgroundMattingV2

Real-Time High-Resolution Background Matting

MIT000

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

NOASSERTION000

AnimateDiff

Official implementation of AnimateDiff.

Apache-2.0000

bark

🔊 Text-Prompted Generative Audio Model

MIT000