trappedinspacetime

Kenn's starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT26447 202 190

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION15569 145 324

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause9990 104 139

pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Language:PythonMIT9656 43 389

kornia

Geometric Computer Vision Library for Spatial AI

Language:PythonApache-2.09526 129 894

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonApache-2.09092 75 98

pytorch-cnn-visualizations

Pytorch implementation of convolutional neural network visualization techniques

Language:PythonMIT7736 115 106

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT5440 35 861

vespa

AI + Data, online. https://vespa.ai

Language:JavaApache-2.05415 158 937

TranslateProject

Linux中国翻译项目

Language:ShellApache-2.02223 164 311

Celestia

Real-time 3D visualization of space.

Language:C++GPL-2.01738 62 542

whisper-plus

WhisperPlus: Faster, Smarter, and More Capable 🚀

Language:PythonApache-2.01517 18 42

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonMIT988 15 28

OpenLRM

An open-source impl. of Large Reconstruction Models

Language:PythonApache-2.0812 27 46

Wine-Builds

Wine builds (Vanilla, Staging, TkG and Proton)

Language:ShellMIT615 23 112

UniControl

Unified Controllable Visual Generation Model

Language:PythonApache-2.0584 19 27

HIPT

Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)

Language:Jupyter NotebookNOASSERTION469 11 69

xtts-webui

Webui for using XTTS and for finetuning it

Language:PythonMIT372 14 68

EAT_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Language:PythonNOASSERTION227 10 26

UDiffText

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Language:PythonMIT173 9 10

Bring DirectX to Linux! This is a Open Source DirectX implementation for Linux, providing native support for DirectX-based applications and games, without relying on Wine's Windows compatibility layer.

Language:C++MIT166 7 7

StoryTTS

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

Language:HTMLNOASSERTION126 18 1

pywhispercpp

Python bindings for whisper.cpp

Language:C++MIT124 6 20

SECap

Language:Python91 2 4

xtts_v2

Language:Python52 2 4

YTTTS

The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions

Language:PythonMIT46 50

tacotron2tr

tacotron2 turkish updates

Language:Python4 20

GTK4PythonExamples

Are you searching for GTK4 Examples in Python3? You are right here!

NOASSERTION4 10