coder(anonymous)'s repositories

Language:CLicense:BSD-3-ClauseStargazers:23Issues:1Issues:6
Language:CLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:0Issues:0Issues:0

FeatherCNN

FeatherCNN is a high performance inference engine for convolutional neural networks.

Language:C++Stargazers:0Issues:0Issues:0

gluon-cv

Gluon CV Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hipBLAS

ROCm BLAS marshalling library

Language:C++License:MITStargazers:0Issues:0Issues:0

HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Chinese only).

Language:JavaScriptLicense:UnlicenseStargazers:0Issues:0Issues:0

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

License:MITStargazers:0Issues:0Issues:0

models

A collection of pre-trained, state-of-the-art models in the ONNX format

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F

Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually.

Language:CLicense:GPL-3.0Stargazers:0Issues:0Issues:0

rankfm

Factorization Machines for Recommendation and Ranking Problems with Implicit Feedback Data

License:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

TileSpGEMM

Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Yuyao Niu, Zhengyang Lu, Haonan Ji, Shuhui Song, Zhou Jin, and Weifeng Liu.

Language:CStargazers:0Issues:0Issues:0

tsm2x-imp

Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA

License:MITStargazers:0Issues:0Issues:0