adamfils

Adam's starred repositories

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonMIT28244 211 227

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.010741 125 217

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonGPL-3.08788 55 502

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7372 323 263

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookApache-2.06842 97 707

dejavu

Audio fingerprinting and recognition in Python

Language:PythonMIT6397 262 242

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonNOASSERTION5354 74 197

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT4342 39 158

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04337 45 189

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonGPL-3.04252 39 423

invisible-watermark

python library for invisible image watermark (blind image watermark)

Language:PythonMIT1563 16 29

rq-scheduler

A lightweight library that adds job scheduling capabilities to RQ (Redis Queue)

Language:PythonMIT1424 42 180

Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Language:Jupyter NotebookMIT1287 36 62

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonApache-2.01153 18 62

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonMIT870 24 74

NeuS2

[ICCV 2023] Official code for NeuS2

Language:CudaNOASSERTION612 22 80

AnimateLCM

AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!

Language:PythonMIT566 29 33

cp-vton

Reimplemented code for "Toward Characteristic-Preserving Image-based Virtual Try-On Network"

Language:PythonMIT474 16 44

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookMIT436 10 11

OpenGPT

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

Language:Jupyter NotebookApache-2.0328 9 6

vampnet

music generation with masked transformers!

Language:Jupyter NotebookMIT288 8 34

frechet-audio-distance

A lightweight library for Frechet Audio Distance calculation.

Language:PythonMIT229 2 13

stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Language:PythonMIT135 30

Diffstyler

DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization

Language:Jupyter Notebook129 1 8

MORPHEUS-1

Implementation of "MORPHEUS-1" from Prophetic AI and "The world’s first multi-modal generative ultrasonic transformer designed to induce and stabilize lucid dreams. "

Language:PythonMIT126 7 2

Self-Cascade

[ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

57 8 1

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.054 10

music-text-representation-pp

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]

Language:Python17 2 2

cog_oot_diffusion

Language:Python3 10

invisible-watermark

python library for invisible image watermark (blind image watermark)

Language:PythonMIT1 10