i-MaTh

i-MaTh

Geek Repo

Company:East China Normal University

Location:Shanghai

Github PK Tool:Github PK Tool

i-MaTh's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:30394Issues:170Issues:497

LLM101n

LLM101n: Let's build a Storyteller

openai-python

The official Python library for the OpenAI API

Language:PythonLicense:Apache-2.0Stargazers:21913Issues:295Issues:745

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16656Issues:139Issues:245

moshi

A modern JSON library for Kotlin and Java.

Language:KotlinLicense:Apache-2.0Stargazers:9684Issues:183Issues:870

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:7390Issues:61Issues:329

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6906Issues:63Issues:20

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:4532Issues:49Issues:317

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3472Issues:64Issues:98

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3259Issues:40Issues:163

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2420Issues:35Issues:46

llm_interview_note

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1741Issues:25Issues:46

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1123Issues:43Issues:26

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1099Issues:37Issues:46

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:502Issues:17Issues:70

MassTTS

a TTS demo for training new characters.

Language:PythonLicense:Apache-2.0Stargazers:434Issues:8Issues:8
Language:PythonLicense:NOASSERTIONStargazers:416Issues:6Issues:42

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonLicense:MITStargazers:186Issues:8Issues:2

MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Language:Jupyter NotebookLicense:MITStargazers:126Issues:4Issues:6
Language:Jupyter NotebookStargazers:119Issues:0Issues:0

Speculative-Sampling

Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

Language:PythonLicense:MITStargazers:67Issues:2Issues:0

detail_tts

All generative model in one for better TTS model

tune_tortoise_autoregressor

Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.

Language:PythonLicense:Apache-2.0Stargazers:15Issues:3Issues:0

audio-flamingo

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0