Meteorix's repositories
meteorix-blog
Meteorix's blog source
pyflame-server
A webservice to facilate the use of pyflame
meteorix.github.io
Meteorix's Blog
bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
CUDA_by_practice
CUDA by practice
WeChatRobot
PC版微信机器人
assignment2-2018
(Spring 2018) Assignment 2: Graph Executor with TVM
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Language:C++Apache-2.0000
Cpp_Primer_Practice
搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
CUDALibrarySamples
CUDA Library Samples
Language:CudaNOASSERTION000
effective_transformer
Running BERT without Padding
flash-attention
Fast and memory-efficient exact attention
Language:C++BSD-3-Clause000
open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language:CNOASSERTION000
tensorflow
An Open Source Machine Learning Framework for Everyone
torchscript-example
Example CMake project for TorchScript
veScale
A PyTorch Native LLM Training Framework
Language:PythonApache-2.0000