MXuer

followers

following

stars

MXuer's repositories

asr-notes

平时学习工作的笔记

Language:Roff1 10

asr-work-mini

For my son, do asr and nlu annotation works.

Language:PythonApache-2.01 10

chinese-asr-kaldi-and-other

Start now, first build a model for chinese from commonvoice, then use keras to build end2end model, keep updating

Language:Python1 20

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

asr-notes-e2e

端到端语音识别相关的一些笔记

010

mms-alignment-tools

using MMS to do the audio-transcript alignment

Language:Python010

books-notes

一些读书笔记

010

codes

learning codes for python, C++, and speech recognition

Language:Python020

documents_llama

Language:Jupyter NotebookApache-2.0010

draw-e2e-arch

端到端语音识别模型的结构图

Apache-2.0010

Federated-learning-ASR

Language:PythonMIT000

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonMIT000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

NOASSERTION000

git-flight-rules

Flight rules for git

CC-BY-SA-4.0000

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language:PythonMIT010

keras-resources

Directory of tutorials and open-source code repositories for working with Keras, the Python deep learning library

020

Kindle_download_helper

Download all your kindle books script.

Language:PythonGPL-3.0000

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0000

mini-asr

code practice for asr models including las, ctc, rnn-t and others.

Apache-2.0010

notes-for-notes

记一些笔记。

Apache-2.0010

notesbooks

日常工作中用到的一些小的活，用jupyter notebook干的

Apache-2.0010

pinyin-data

汉字拼音数据

Language:PythonMIT010

reading-paper-notes

notes for paper reading

010

sft_datacollections

010

speech-recognition-papers

Towards hot directions in industrial speech recognition

MIT000

speechocean762

A dataset for pronunciation scoring tasks.

000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Language:Python000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.0000

whisper-asr-finetune

Language:PythonMIT000

whisper-eval

用Whisper不同的模型，在不同语种、不同测试集上的效果。

Language:PythonApache-2.0010