Doron Adler's starred repositories

difftastic

a structural diff that understands syntax 🟥🟩

Language:RustLicense:MITStargazers:19823Issues:61Issues:565

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17356Issues:158Issues:274

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:11225Issues:79Issues:468

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6991Issues:87Issues:104

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5611Issues:37Issues:70

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3496Issues:70Issues:93

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:Apache-2.0Stargazers:3335Issues:174Issues:92

Applite

User-friendly GUI macOS application for Homebrew Casks

Language:SwiftLicense:MITStargazers:3249Issues:14Issues:39

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:2881Issues:28Issues:890

StableSwarmUI

StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1788Issues:17Issues:41

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Language:PythonLicense:MITStargazers:1259Issues:17Issues:48

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Language:PythonLicense:Apache-2.0Stargazers:895Issues:42Issues:34

live555

A mirror of the live555 source code.

Language:C++License:LGPL-3.0Stargazers:726Issues:56Issues:40

OMG

OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

depth-fm

DepthFM: Fast Monocular Depth Estimation with Flow Matching

Language:Jupyter NotebookLicense:MITStargazers:284Issues:9Issues:18

Smooth-Diffusion

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Language:PythonLicense:MITStargazers:273Issues:22Issues:12

swiftui-model3dview

Render 3d models with SwiftUI effortlessly

Language:SwiftLicense:MITStargazers:210Issues:1Issues:4

transformer-heads

Toolkit for attaching, training, saving and loading of new heads for transformer models

Language:Jupyter NotebookLicense:MITStargazers:207Issues:5Issues:1

tryondiffusion

PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Google

Language:PythonLicense:MITStargazers:135Issues:0Issues:0

grog

Gradio UI for a Cog API

Language:PythonLicense:MITStargazers:57Issues:2Issues:2

StyleSketch

official repository of StyleSketch

facecap

Babylon.js + Mediapipe face capture

Language:JavaScriptLicense:MITStargazers:53Issues:2Issues:1

hebrew_whisper

Hebrew whisper powerful transcription and translation tool

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

diffusers_ddim_inversion

A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space

Language:PythonLicense:CC0-1.0Stargazers:39Issues:1Issues:0

notebooks

A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).

Language:Jupyter NotebookStargazers:32Issues:2Issues:1

live555-simple-demo-4-iOS

Build the live555 to an static library in Xcode for an iOS application.

Language:C++License:LGPL-2.1Stargazers:11Issues:2Issues:1