Learning Chip's starred repositories
open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
DiffusionFastForward
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
stripedhyena
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
float8_experimental
This repository contains the experimental PyTorch native float8 training UX
self-speculative-decoding
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
CUDA-Programs
Examples from Programming in Parallel with CUDA