Othanse's starred repositories

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6516Issues:0Issues:0

MoviePilot

NAS媒体库自动化管理工具

Language:PythonLicense:GPL-3.0Stargazers:5666Issues:0Issues:0

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:2803Issues:0Issues:0

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3699Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:5454Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:38719Issues:0Issues:0

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:1066Issues:0Issues:0

zest_code

This is the official implementation of ZeST

Language:Jupyter NotebookLicense:MITStargazers:330Issues:0Issues:0

onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Language:PythonLicense:Apache-2.0Stargazers:1480Issues:0Issues:0

oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

Language:C++License:Apache-2.0Stargazers:5811Issues:0Issues:0

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:59425Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15278Issues:0Issues:0
Language:PythonLicense:MITStargazers:4105Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27599Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11315Issues:0Issues:0

Stable-Diffusion-WebUI-TensorRT

TensorRT Extension for Stable Diffusion Web UI

Language:PythonLicense:MITStargazers:1859Issues:0Issues:0

sdwebuiapi

Python API client for AUTOMATIC1111/stable-diffusion-webui

Language:Jupyter NotebookLicense:MITStargazers:1321Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7460Issues:0Issues:0

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2622Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:51656Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34622Issues:0Issues:0

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4767Issues:0Issues:0

emotional-vits

无需情感标注的情感可控语音合成模型,基于VITS

Language:Jupyter NotebookLicense:MITStargazers:1281Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7563Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20299Issues:0Issues:0

audioFlux

A library for audio and music analysis, feature extraction.

Language:CLicense:MITStargazers:2104Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129794Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29783Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21480Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33961Issues:0Issues:0