Beast code in Giters

HN410's starred repositories

gligen-gui

An intuitive GUI for GLIGEN that uses ComfyUI in the backend

Language:JavaScriptNOASSERTION196900

imgbrd-grabber

Very customizable imageboard/booru downloader with powerful filenaming features.

Language:HTMLApache-2.0243000

obsidian-tabs

Plugin for tabbed obsidian browsing

Language:CSS16200

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonMIT175200

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:PythonMIT12500

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonApache-2.0358000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2999400

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:PythonMIT46800

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2766800

Anime4K

A High-Quality Real Time Upscaler for Anime Video

Language:Jupyter NotebookMIT1807300

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0831700

Aivis-Dataset

💠 Aivis: AI Voice Imitation System

Language:PythonMIT2500

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonAGPL-3.063700

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause1026900

DragNUWA

73300

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookMIT362800

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.0571900

MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Language:PythonApache-2.0120400

FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Language:Python23300

Sel-CL

CVPR 2022: Selective-Supervised Contrastive Learning with Noisy Labels

Language:Python100

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Apache-2.01416300

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION3465500

Emehcs

A Scheme-like language interpreter

Language:C++MIT1300

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT855200

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT746900

ChartSeer

ChartSeer: Interactive Steering Exploratory Visual Analysis with Machine Intelligence

Language:JavaScriptNOASSERTION3600

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language:PythonApache-2.0201000

Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Language:PythonMIT55900

animatediff-cli

a CLI utility/library for AnimateDiff stable diffusion generation

Language:PythonApache-2.025600

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

292200