Xu Kai's repositories
PUNASfilter
Source code for the parallel ungapped-alignment-featured seed verification (PUNAS) algorithm for next generation sequence alignment.
ColossalAI
Making large AI models cheaper, faster and more accessible
dlbook_exercises
Exercises for the Deep Learning textbook at www.deeplearningbook.org
cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
public_assets
Storing publicly available assets such as images, animations and texts
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
SDU_thesis_template_for_postgraduate
山东大学硕/博士研究生毕业论文模板
TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe