pashanitw

open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

MIT100

my-website

Driven by nextjs, shadcnui style blog template.

Language:TypeScriptMIT100

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0100

react-chatbotify

A modern React library for creating flexible and extensible chatbots.

Language:TypeScriptMIT100

SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Language:PythonMIT100

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

MIT100

ttts

Train the next generation of TTS systems.

Language:PythonMPL-2.0100

WebDesignAgent

An agent used for webdesign

Apache-2.0100

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Language:CApache-2.0100

pashanitw

Pasha S's repositories

llama3-and-friends-from-scratch

xeus-finetune

DictionaryByGPT4

anole

AudioNotes

auto-prompt-engineering

avatar

ControlSpeech

CosyVoice

dataspeech

DiffSynth-Studio

FlashSpeech

GPT-Talker

Kolors

LivePortrait

llamacoder

mini-omni

my-website

parler-tts

react-chatbotify

SSR-Speech

StableTTS

ttts

ultravox

VITS-Plus

WebDesignAgent

Whisper-Finetune

index-tts

LLaSA_training

Zonos