wm901115nwpu's starred repositories
triton-shared
Shared Middle-Layer for Triton Compilation
llama_index
LlamaIndex is a data framework for your LLM applications
resource-stream
CUDA related news and material links
pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用
ring-attention
ring-attention experiments
parler-tts
Inference and training library for high-quality TTS models.
recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
whisper.cpp
Port of OpenAI's Whisper model in C/C++