Jinze Xue's starred repositories
flash-attention
Fast and memory-efficient exact attention
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
WebPlotDigitizer
Online tool to extract numerical data from plot images.
llama2.mojo
Inference Llama 2 in one file of pure 🔥
chemcrow-public
Chemcrow
torch-harmonics
Differentiable spherical harmonic transforms and spherical convolutions in PyTorch
torchmd-protein-thermodynamics
Tutorials and data necessary to reproduce the results of publication Machine Learning Coarse-Grained Potentials of Protein Thermodynamics
qca-dataset-submission
Data generation and submission scripts for the QCArchive ecosystem.
nanoreactor
Nanoreactor analysis codes (not yet released)
nanoreactor_processing
Post-processing code for computational nanoreactor simulations