HN410's starred repositories

gligen-gui

An intuitive GUI for GLIGEN that uses ComfyUI in the backend

Language:JavaScriptLicense:NOASSERTIONStargazers:1969Issues:0Issues:0

imgbrd-grabber

Very customizable imageboard/booru downloader with powerful filenaming features.

Language:HTMLLicense:Apache-2.0Stargazers:2430Issues:0Issues:0

obsidian-tabs

Plugin for tabbed obsidian browsing

Language:CSSStargazers:162Issues:0Issues:0

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonLicense:MITStargazers:1752Issues:0Issues:0

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:PythonLicense:MITStargazers:125Issues:0Issues:0

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3580Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29994Issues:0Issues:0

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:PythonLicense:MITStargazers:468Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27668Issues:0Issues:0

Anime4K

A High-Quality Real Time Upscaler for Anime Video

Language:Jupyter NotebookLicense:MITStargazers:18073Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8317Issues:0Issues:0

Aivis-Dataset

💠 Aivis: AI Voice Imitation System

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonLicense:AGPL-3.0Stargazers:637Issues:0Issues:0

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10269Issues:0Issues:0
Stargazers:733Issues:0Issues:0

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3628Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5719Issues:0Issues:0

MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Language:PythonLicense:Apache-2.0Stargazers:1204Issues:0Issues:0

FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Language:PythonStargazers:233Issues:0Issues:0

Sel-CL

CVPR 2022: Selective-Supervised Contrastive Learning with Noisy Labels

Language:PythonStargazers:1Issues:0Issues:0

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

License:Apache-2.0Stargazers:14163Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34655Issues:0Issues:0

Emehcs

A Scheme-like language interpreter

Language:C++License:MITStargazers:13Issues:0Issues:0

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8552Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7469Issues:0Issues:0

ChartSeer

ChartSeer: Interactive Steering Exploratory Visual Analysis with Machine Intelligence

Language:JavaScriptLicense:NOASSERTIONStargazers:36Issues:0Issues:0

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2010Issues:0Issues:0

Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Language:PythonLicense:MITStargazers:559Issues:0Issues:0

animatediff-cli

a CLI utility/library for AnimateDiff stable diffusion generation

Language:PythonLicense:Apache-2.0Stargazers:256Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:2922Issues:0Issues:0