uloveqian2021's repositories
gaiic_task2
电商领域命名实体识别
athena
An open-source implementation of sequence-to-sequence based speech processing engine
Awesome-Domain-LLM
收集和梳理垂直领域的开源模型、数据集及评测基准。
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合
blog_source
the source directory of my blog
CLAP
Contrastive Language-Audio Pretraining
ColossalAI-ChatGPT
Making big AI models cheaper, easier, and scalable
ctc-kws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
emformer
asr based on emformer
FeatureExtractionKWS
SP: Feature Extraction for Speech Recognition using OFA
keyword-spotting-ctc
端到端语音唤醒工具箱,从模型训练到模型推理。
landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
Latte
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
Medical_NLP
Medical NLP Competition, dataset, large models, paper 医疗NLP领域 比赛,数据集,大模型,论文,工具包
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
MOSS
An open-source tool-augmented conversational language model from Fudan University
RapidASR
A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.
speech-datasets-collection
a curated list of speech datasets (105+ datasets, 70+ easy to download)
ss-vad
self-supervised vad
T2A
Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023
vits-onnx
A fast, local neural text to speech system
ViTS-TTS
SummerTTS 是一个基于C++的独立编译的中文语音合成项目,没有额外的依赖,一键编译完成即可用于中文语音合成。SummerTTS is a standalone Chinese speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out