rogue-yogi

followers

following

stars

sync. labs

SF

prady@pradym.xyz

Organizations

synchronicityAI

rogue yogi's starred repositories

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonApache-2.0163600

nhost

The Open Source Firebase Alternative with GraphQL.

Language:TypeScriptMIT772600

black

The uncompromising Python code formatter

Language:PythonMIT3809500

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonMIT761800

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:Python210800

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2770800

Devon

Devon: An open-source pair programmer

Language:PythonAGPL-3.0270400

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT1163800

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION1694800

translation-starter

Language:TypeScriptMIT55700

roop

one-click face swap

Language:PythonGPL-3.02588000

DR2_Drgradation_Remover

DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration. CVPR 2023.

Language:PythonMIT7700

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonMIT333600

wing

A programming language for the cloud ☁️ A unified programming model, combining infrastructure and runtime code into one language ⚡

Language:TypeScriptNOASSERTION481900

nvidia-container-toolkit

Build and run containers leveraging NVIDIA GPUs

Language:GoApache-2.0201700

cog

Containers for machine learning

Language:PythonApache-2.0753900

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT2177400

coffee

Build and iterate on your UI 10x faster with AI - right from your own IDE ☕️

Language:PythonApache-2.0140800

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonApache-2.0655800

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonNOASSERTION1459100

scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Language:PythonApache-2.01145700

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonNOASSERTION113800

phonemizer

Simple text to phones converter for multiple languages

Language:PythonGPL-3.0116300

speech-driven-animation

Language:Python94600

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03241000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01847600

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION1061100