Othanse

followers

following

stars

Othanse's starred repositories

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell651600

MoviePilot

NAS媒体库自动化管理工具

Language:PythonGPL-3.0566600

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonApache-2.0280300

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonApache-2.0369900

stable-diffusion-webui-forge

Language:PythonAGPL-3.0545400

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.03871900

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonMIT106600

zest_code

This is the official implementation of ZeST

Language:Jupyter NotebookMIT33000

onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Language:PythonApache-2.0148000

oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

Language:C++Apache-2.0581100

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonGPL-3.05942500

MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonMIT1527800

TripoSR

Language:PythonMIT410500

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2759900

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION1131500

Stable-Diffusion-WebUI-TensorRT

TensorRT Extension for Stable Diffusion Web UI

Language:PythonMIT185900

sdwebuiapi

Python API client for AUTOMATIC1111/stable-diffusion-webui

Language:Jupyter NotebookMIT132100

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT746000

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookMIT262200

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION5165600

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION3462200

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonNOASSERTION476700

emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Language:Jupyter NotebookMIT128100

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonAGPL-3.0756300

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2029900

audioFlux

A library for audio and music analysis, feature extraction.

Language:CMIT210400

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.012979400

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2978300

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT2148000

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT3396100