i-MaTh

i-MaTh's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.030394 170 497

LLM101n

LLM101n: Let's build a Storyteller

27960 19950

openai-python

The official Python library for the OpenAI API

Language:PythonApache-2.021913 295 745

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonMIT16656 139 245

moshi

A modern JSON library for Kotlin and Java.

Language:KotlinApache-2.09684 183 870

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION7390 61 329

corenet

CoreNet: A library for training deep neural networks

Language:PythonNOASSERTION6906 63 20

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonApache-2.04532 49 317

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonMIT3472 64 98

ng-video-lecture

Language:Python3403 53 27

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION3259 40 163

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

3145 23 8

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookAGPL-3.02420 35 46

llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

Language:HTML2405 10 6

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION1741 25 46

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonMIT1123 43 26

dclm

DataComp for Language Models

Language:HTMLMIT1099 37 46

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:Python795 11 22

ai-voice-cloning

Language:PythonGPL-3.0532 18 126

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.0502 17 70

TeleSpeech-ASR

Language:Python449 13 44

MassTTS

a TTS demo for training new characters.

Language:PythonApache-2.0434 8 8

DMD2

Language:PythonNOASSERTION416 6 42

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonMIT186 8 2

MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Language:Jupyter NotebookMIT126 4 6

LoRA-GA

Language:Jupyter Notebook11900

Speculative-Sampling

Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

Language:PythonMIT67 20

detail_tts

All generative model in one for better TTS model

Language:Python62 3 1

tune_tortoise_autoregressor

Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.

Language:PythonApache-2.015 30

audio-flamingo

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Language:PythonMIT200