rogue yogi (rogue-yogi)

rogue-yogi

Geek Repo

Company:sync. labs

Location:SF

Home Page:prady@pradym.xyz

Twitter:@therealprady

Github PK Tool:Github PK Tool


Organizations
synchronicityAI

rogue yogi's starred repositories

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonLicense:Apache-2.0Stargazers:1636Issues:0Issues:0

nhost

The Open Source Firebase Alternative with GraphQL.

Language:TypeScriptLicense:MITStargazers:7726Issues:0Issues:0

black

The uncompromising Python code formatter

Language:PythonLicense:MITStargazers:38095Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:7618Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:PythonStargazers:2108Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27708Issues:0Issues:0

Devon

Devon: An open-source pair programmer

Language:PythonLicense:AGPL-3.0Stargazers:2704Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11638Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:16948Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:557Issues:0Issues:0

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:25880Issues:0Issues:0

DR2_Drgradation_Remover

DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration. CVPR 2023.

Language:PythonLicense:MITStargazers:77Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3336Issues:0Issues:0

wing

A programming language for the cloud ☁️ A unified programming model, combining infrastructure and runtime code into one language ⚡

Language:TypeScriptLicense:NOASSERTIONStargazers:4819Issues:0Issues:0

nvidia-container-toolkit

Build and run containers leveraging NVIDIA GPUs

Language:GoLicense:Apache-2.0Stargazers:2017Issues:0Issues:0

cog

Containers for machine learning

Language:PythonLicense:Apache-2.0Stargazers:7539Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21774Issues:0Issues:0

coffee

Build and iterate on your UI 10x faster with AI - right from your own IDE ☕️

Language:PythonLicense:Apache-2.0Stargazers:1408Issues:0Issues:0

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonLicense:Apache-2.0Stargazers:6558Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:14591Issues:0Issues:0

scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Language:PythonLicense:Apache-2.0Stargazers:11457Issues:0Issues:0

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1138Issues:0Issues:0

phonemizer

Simple text to phones converter for multiple languages

Language:PythonLicense:GPL-3.0Stargazers:1163Issues:0Issues:0
Language:PythonStargazers:946Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32410Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18476Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10611Issues:0Issues:0