George Grigorev (thepowerfuldeez)

thepowerfuldeez

Geek Repo

Location:London

Github PK Tool:Github PK Tool

George Grigorev's starred repositories

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:29177Issues:216Issues:530

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10556Issues:123Issues:205

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10204Issues:125Issues:654

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9307Issues:76Issues:454

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Language:PythonLicense:BSD-3-ClauseStargazers:2431Issues:69Issues:1445

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:1495Issues:30Issues:48

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1495Issues:27Issues:174

auto-subtitle

Automatically generate and overlay subtitles for any video.

Language:PythonLicense:MITStargazers:1343Issues:17Issues:63

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:1122Issues:16Issues:35

CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Language:PythonLicense:MITStargazers:1012Issues:14Issues:18

hyperstyle

Official Implementation for "HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing" (CVPR 2022) https://arxiv.org/abs/2111.15666

Language:PythonLicense:MITStargazers:1001Issues:28Issues:80

PTI

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Language:Jupyter NotebookLicense:MITStargazers:891Issues:23Issues:57
Language:PythonLicense:Apache-2.0Stargazers:846Issues:39Issues:62

Cartoon-StyleGAN

Fine-tuning StyleGAN2 for Cartoon Face Generation

Language:Jupyter NotebookStargazers:633Issues:19Issues:19
Language:PythonLicense:GPL-3.0Stargazers:453Issues:11Issues:52

ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

Language:PythonLicense:NOASSERTIONStargazers:282Issues:14Issues:0

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Language:C++License:Apache-2.0Stargazers:246Issues:15Issues:22

NeuralNeighborStyleTransfer

Optimization based style transfer

Language:PythonLicense:MITStargazers:244Issues:10Issues:12

RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - CVPR 2024 - Official Repo

Language:PythonLicense:MITStargazers:215Issues:8Issues:15

ml_paper_club

A repository of papers that have been presented at nPlan's machine learning paper club

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonLicense:MITStargazers:201Issues:15Issues:40

Solving_ImageNet

Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)

Language:PythonLicense:Apache-2.0Stargazers:190Issues:5Issues:6

candlestick_retriever

Retrieve all historical candlestick data from crypto exchange Binance and upload it to Kaggle.

Language:PythonLicense:GPL-3.0Stargazers:154Issues:8Issues:18

SingingVocoders

A collection of neural vocoders suitable for singing voice synthesis tasks.

Language:PythonLicense:MITStargazers:83Issues:3Issues:8

MelHuBERT

Official implementation of MelHuBERT

Language:PythonLicense:MITStargazers:56Issues:4Issues:3

fantasticstyles

Repository for Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs