Kenn (trappedinspacetime)

trappedinspacetime

Geek Repo

Company:For Personal Use

Location:Istanbul

Github PK Tool:Github PK Tool

Kenn's starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:26447Issues:202Issues:190

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:15569Issues:145Issues:324

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:9990Issues:104Issues:139

pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Language:PythonLicense:MITStargazers:9656Issues:43Issues:389

kornia

Geometric Computer Vision Library for Spatial AI

Language:PythonLicense:Apache-2.0Stargazers:9526Issues:129Issues:894

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9092Issues:75Issues:98

pytorch-cnn-visualizations

Pytorch implementation of convolutional neural network visualization techniques

Language:PythonLicense:MITStargazers:7736Issues:115Issues:106

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5440Issues:35Issues:861

vespa

AI + Data, online. https://vespa.ai

Language:JavaLicense:Apache-2.0Stargazers:5415Issues:158Issues:937

TranslateProject

Linux中国翻译项目

Language:ShellLicense:Apache-2.0Stargazers:2223Issues:164Issues:311

Celestia

Real-time 3D visualization of space.

Language:C++License:GPL-2.0Stargazers:1738Issues:62Issues:542

whisper-plus

WhisperPlus: Faster, Smarter, and More Capable 🚀

Language:PythonLicense:Apache-2.0Stargazers:1517Issues:18Issues:42

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:988Issues:15Issues:28

OpenLRM

An open-source impl. of Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:812Issues:27Issues:46

Wine-Builds

Wine builds (Vanilla, Staging, TkG and Proton)

Language:ShellLicense:MITStargazers:615Issues:23Issues:112

UniControl

Unified Controllable Visual Generation Model

Language:PythonLicense:Apache-2.0Stargazers:584Issues:19Issues:27

HIPT

Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:469Issues:11Issues:69

xtts-webui

Webui for using XTTS and for finetuning it

Language:PythonLicense:MITStargazers:372Issues:14Issues:68
Language:PythonLicense:NOASSERTIONStargazers:271Issues:14Issues:0

EAT_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Language:PythonLicense:NOASSERTIONStargazers:227Issues:10Issues:26

UDiffText

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Language:PythonLicense:MITStargazers:173Issues:9Issues:10

OpenDX

Bring DirectX to Linux! This is a Open Source DirectX implementation for Linux, providing native support for DirectX-based applications and games, without relying on Wine's Windows compatibility layer.

Language:C++License:MITStargazers:166Issues:7Issues:7

StoryTTS

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

Language:HTMLLicense:NOASSERTIONStargazers:126Issues:18Issues:1

pywhispercpp

Python bindings for whisper.cpp

Language:C++License:MITStargazers:124Issues:6Issues:20

YTTTS

The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions

Language:PythonLicense:MITStargazers:46Issues:5Issues:0

tacotron2tr

tacotron2 turkish updates

Language:PythonStargazers:4Issues:2Issues:0

GTK4PythonExamples

Are you searching for GTK4 Examples in Python3? You are right here!

License:NOASSERTIONStargazers:4Issues:1Issues:0