Pasha S (pashanitw)

pashanitw

User data from Github https://github.com/pashanitw

0

followers

0

following

0

stars

Location:Hyderabad, India

GitHub:@pashanitw

Twitter:@psk90_ai

Pasha S's repositories

DictionaryByGPT4

一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:2Issues:0Issues:0

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Stargazers:1Issues:0Issues:0

AudioNotes

快速提取音视频内容,整理成一份结构化的markdown笔记

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

avatar

AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval (https://arxiv.org/abs/2406.11200)

Language:PythonStargazers:1Issues:0Issues:0

ControlSpeech

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Language:PythonStargazers:1Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

FlashSpeech

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Stargazers:1Issues:0Issues:0

GPT-Talker

Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)

Language:PythonStargazers:1Issues:0Issues:0

Kolors

Kolors Team

License:Apache-2.0Stargazers:1Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

llamacoder

Open source Claude Artifacts – built with Llama 3.1 405B

Language:TypeScriptStargazers:1Issues:0Issues:0

mini-omni

open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

License:MITStargazers:1Issues:0Issues:0

my-website

Driven by nextjs, shadcnui style blog template.

Language:TypeScriptLicense:MITStargazers:1Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

react-chatbotify

A modern React library for creating flexible and extensible chatbots.

Language:TypeScriptLicense:MITStargazers:1Issues:0Issues:0

SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

License:MITStargazers:1Issues:0Issues:0

ttts

Train the next generation of TTS systems.

Language:PythonLicense:MPL-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:1Issues:0

WebDesignAgent

An agent used for webdesign

License:Apache-2.0Stargazers:1Issues:0Issues:0

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Language:CLicense:Apache-2.0Stargazers:1Issues:0Issues:0

index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Stargazers:0Issues:0Issues:0

LLaSA_training

LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0