Melon's repositories
CIF-PyTorch
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
CIF-HieraDist
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
CIF-ColDec
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
4675-Scifi-Collections
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
acad-homepage.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
best-of-streamlit
🏆 A ranked gallery of awesome streamlit apps built by the community
ChatGLM-Finetuning
基于ChatGLM-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning等
espnet-asrtts
ASR-TTS experiments based on espnet. recipe for librispeech available
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Lora-Diffusion-Models
Using Low-rank adaptation to quickly fine-tune diffusion models.
streamlit-webrtc
Real-time video and audio streams over the network, with Streamlit.
streamlit_audio_recorder
Streamlit Custom Component that enables recording audio from the client's mic in apps that are deployed to the web. (via browser Media-API, REACT-based)
VideosShareByAliyun
🤪动漫、电视剧、电影、纪录片分享by阿里云盘🫡
Books-Free-Books
Free Books
meeting_summarization_dataset
Codes for processing meeting summarization datasets AMI and ICSI.
minimal-light
A simple and elegant Jekyll theme for an academic personal homepage
MSCOCO-CN
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Temperature-Scaling
A simple way to calibrate your neural network.