KMedia's repositories

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CoreML-Models

Converted CoreML Model Zoo.

Stargazers:0Issues:0Issues:0

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Applio

Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Awesome-GitHub-Repo

收集整理 GitHub 上高质量、有趣的开源项目。

License:CC0-1.0Stargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

License:MITStargazers:0Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

License:GPL-3.0Stargazers:0Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

License:Apache-2.0Stargazers:0Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

Stargazers:0Issues:0Issues:0

ffmpeg-apple-arm64-build

Build script for ffmpeg targeting the latest open source video codecs running on macOS using Apple's M1 processor.

Stargazers:0Issues:0Issues:0

freeswitch

FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.

License:NOASSERTIONStargazers:0Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

License:MITStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

License:MITStargazers:0Issues:0Issues:0

image-colorization-api

Image Colorization Service using Deep Learning is a repository that provides an API for colorizing black and white images using U-Net and conditional GAN models trained on the COCO dataset, with support for batch processing, dataset expansion, model experiments, and efficient inference using ONNX format.

Stargazers:0Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

lite.ai.toolkit

🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOv5, YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv8. MNN, NCNN, TNN, ONNXRuntime.

License:GPL-3.0Stargazers:0Issues:0Issues:0

magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

media-server-1

librtsp/librtmp/libmpeg/libhls/librtp

Language:CStargazers:0Issues:0Issues:0

netron

Visualizer for neural network, deep learning and machine learning models

License:MITStargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription. Designed for real-time applications like voice assistants.

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

License:NOASSERTIONStargazers:0Issues:0Issues:0

stable-diffusion.cpp

Stable Diffusion in pure C/C++

License:MITStargazers:0Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

XRG

System monitor for macOS.

License:GPL-2.0Stargazers:0Issues:0Issues:0