Ye Bai's starred repositories
int_fastdiv
Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.
ReazonSpeech
Massive open Japanese speech corpus
LLMBook-zh.github.io
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣
Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
everyone-can-use-english
人人都能用英语
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
forcealign
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
SecurityInterviewGuide
网络信息安全从业者面试指南
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
Awesome_Modern_Hopfield_Networks
Paper list for Modern Hopfield Networks
InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
aac-metrics
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
LinearAttentionArena
Here we will test various linear attention designs.