zhifu gao (LauraGPT)

LauraGPT

Geek Repo

Company:alibaba

Github PK Tool:Github PK Tool


Organizations
FunAudioLLM

zhifu gao's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40167Issues:393Issues:1291

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:30167Issues:190Issues:1003

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28325Issues:187Issues:4463

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12067Issues:135Issues:197

ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Language:PythonLicense:Apache-2.0Stargazers:9350Issues:89Issues:116

LeetCode-Book

《剑指 Offer》 Python, Java, C++ 解题代码,LeetBook《图解算法数据结构》配套代码仓

Language:JavaLicense:NOASSERTIONStargazers:5587Issues:46Issues:7

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5253Issues:55Issues:982

stock

30天掌握量化交易 (持续更新)

Language:PythonLicense:BSD-3-ClauseStargazers:4983Issues:261Issues:30

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language:PythonLicense:MITStargazers:2906Issues:29Issues:74

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2661Issues:43Issues:407

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaLicense:Apache-2.0Stargazers:1090Issues:76Issues:374

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:982Issues:17Issues:85

TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Language:PythonLicense:Apache-2.0Stargazers:918Issues:31Issues:207

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++License:Apache-2.0Stargazers:785Issues:37Issues:232

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

TensorflowASR

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1

Language:PythonLicense:Apache-2.0Stargazers:460Issues:22Issues:49

ai00_server

A localized open-source AI server that is better than ChatGPT.

Language:RustLicense:MITStargazers:448Issues:15Issues:63

BitcoinForecast

Predict bitcoin price with deep learning

EasyParallelLibrary

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Language:PythonLicense:Apache-2.0Stargazers:257Issues:13Issues:9

Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:241Issues:15Issues:4

MyQuant

我的量化交易学习实践代码

warp-rnnt

CUDA-Warp RNN-Transducer

Language:PythonLicense:MITStargazers:211Issues:9Issues:34

transducer

A Fast Sequence Transducer Implementation with PyTorch Bindings

Language:C++License:Apache-2.0Stargazers:195Issues:9Issues:15

e2e_lfmmi

E2E system with LF-MMI; word N-gram for Mandarin

LearnLibTorch

LibTorch 中文教程。

Language:PythonLicense:MITStargazers:66Issues:2Issues:2

AliParaformerAsr

c# library for decoding paraformer, sensevoice Models,used in speech recognition (ASR)

Language:C#License:Apache-2.0Stargazers:21Issues:1Issues:7

AliCTTransformerPunc

c# library for decoding CTTransformer punc models, which can add punctuation to Chinese and English texts

Language:C#Stargazers:6Issues:1Issues:0