renolynx

renolynx

Geek Repo

Github PK Tool:Github PK Tool

renolynx's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29596Issues:189Issues:974

maybe

The OS for your personal finances

Language:RubyLicense:AGPL-3.0Stargazers:28827Issues:149Issues:299

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:16755Issues:161Issues:365

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6451Issues:61Issues:121

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5691Issues:66Issues:405

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4576Issues:54Issues:98

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4320Issues:43Issues:357

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4103Issues:39Issues:136

SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Language:PythonLicense:NOASSERTIONStargazers:3995Issues:70Issues:123

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3607Issues:72Issues:96

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:2198Issues:41Issues:63

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2181Issues:30Issues:105

notesGPT

Record voice notes & transcribe, summarize, and get tasks

Language:TypeScriptLicense:MITStargazers:1603Issues:21Issues:20
Language:PythonLicense:NOASSERTIONStargazers:785Issues:18Issues:106

taggui

Tag manager and captioner for image datasets

Language:PythonLicense:GPL-3.0Stargazers:521Issues:9Issues:145

clip-interrogator-ext

Stable Diffusion WebUI extension for CLIP Interrogator

Language:PythonLicense:MITStargazers:477Issues:10Issues:75
Language:PythonLicense:Apache-2.0Stargazers:390Issues:6Issues:23

DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

freecontrol

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

sdweb-merge-block-weighted-gui

Merge models with separate rate for each 25 U-Net block (input, middle, output). Extension for Stable Diffusion UI by AUTOMATIC1111

LECO

Low-rank adaptation for Erasing COncepts from diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:302Issues:7Issues:27

ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Language:PythonLicense:MITStargazers:224Issues:17Issues:26

Comfy_Dungeon

At the moment this is mostly a tech demo to show how to build a web app on top of ComfyUI

Language:JavaScriptLicense:Apache-2.0Stargazers:194Issues:10Issues:2

ComfyUI-Qwen-VL-API

QWen-VL-Plus & QWen-VL-Max in ComfyUI

Language:PythonLicense:GPL-3.0Stargazers:185Issues:4Issues:5

CartoonSegmentation

Instance segmentation for cartoon/anime characters and some visual techniques building around it.

Language:Jupyter NotebookStargazers:128Issues:9Issues:5
Language:PythonLicense:MPL-2.0Stargazers:73Issues:0Issues:0
Language:PythonLicense:MITStargazers:37Issues:0Issues:0

img-txt_viewer

Display an image and text file side-by-side for easy manual caption editing.

Language:PythonLicense:CC0-1.0Stargazers:34Issues:1Issues:9