Beast code in Giters

Fu-An Chao's starred repositories

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonApache-2.01069300

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION148200

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:Python122100

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonApache-2.0424000

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonApache-2.0115300

SPMamba

Language:PythonApache-2.012700

mamba

Mamba SSM architecture

Language:PythonApache-2.01320100

dysarthria-gop

Language:PythonMIT2000

self-supervised-phone-segmentation

Phoneme segmentation using pre-trained speech models

Language:PythonGPL-3.05200

pheme

Language:PythonCC-BY-4.025200

portfolYOU

A beautiful portfolio Jekyll theme that works with GitHub Pages.

Language:HTMLMIT98900

articulatory

Deep Articulatory Synthesis and Inversion

Language:PythonApache-2.04300

accent-recog-slt2022

Repository for Accent Recognition (Hackathon @SLT2022)

Language:Jupyter NotebookMIT2200

SB_loss_PA

This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).

Language:PythonBSD-3-Clause1500

INTERSPEECH-2023-24-Papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

MIT64100

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause1246500

gop-dnn-epadb

Goodness of Pronunciation using Kaldi on Epa-DB database

Language:Python3300

python-audio-effects

Apply audio effects such as reverb and EQ directly to audio files or NumPy ndarrays.

Language:PythonMIT38500

SpeechPrompt

**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

Language:Python9700