RuQing Xu's repositories
blis_apple
BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.
BliContractor.jl
TensorOperations.jl compatible fast contractor for Julia, based on TBLIS, with generic strides and automatic differentiation support, within 400 lines.
rime-hifumi
超軽量 IME Rime で日本語入力
TensorAnyDiff.jl
(Obsolete) Tensordot and Einsum for Julia with support for 2nd order derivatives.
Pfaffian.jl
Simple Julia program for computing Pfaffian and inverse of an antisymmetric matrix.
MPSSimpleGemm
[+Julia] Use Metal Performance Shaders to Compute Low-Precision GEMM.
tblis
Arm kernels & tunes for TBLIS: Direct packing & contraction of tensors.
abeliantensors
A library for Abelian symmetry preserving tensors in Python 3
BlisSandboxOnA14
BLIS Sandbox for iOS. Should work on A13, A14, A15 and M1.
ClGemmerator.jl
Trying to make some OpenCL templates for GEMM.
cutlass
CUDA Templates for Linear Algebra Subroutines
DualArray.jl
Forward-mode Differentiation for Arrays.
ForwardDiff.jl
Forward Mode Automatic Differentiation for Julia
hptt
High-Performance Tensor Transpose library
libblastrampoline
Using PLT trampolines to provide a BLAS and LAPACK demuxing library.
librime
Rime Input Method Engine, the core library
MLCSimpleGemm
[+Julia] Simple C-to-Swift calling of ML Compute GEMM.
Pfapack77.jl
Julia wrapper over Pfapack77
rime-japanese
日语输入法 Input method for typing Japanese with RIME
TensorOperations.jl
Julia package for tensor contractions and related operations