Kylin's repositories

akg

AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops with specific patterns.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:0Issues:0Issues:0

BlogLearning

自己的学习历程,重点包括各种好玩的图像处理算法、运动捕捉、机器学习

Stargazers:0Issues:0Issues:0

bolt

10x faster matrix and vector operations.

License:MPL-2.0Stargazers:0Issues:0Issues:0

EET

Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model

License:Apache-2.0Stargazers:0Issues:0Issues:0

EMLL

Edge Machine Learning Library

License:Apache-2.0Stargazers:0Issues:0Issues:0

FasterStereoCuda-Library

这是一个基于CUDA加速的快速立体匹配库,它的核心是SemiglobalMatching(SGM)算法,它不仅在时间效率上要远远优于基于CPU的常规SGM,而且占用明显更少的内存,这意味着它不仅可以在较低分辨率(百万级)图像上达到实时的帧率,且完全具备处理千万级甚至更高量级图像的能力。

Language:C++Stargazers:0Issues:1Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

License:Apache-2.0Stargazers:0Issues:0Issues:0

heterocl

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Chinese).

License:UnlicenseStargazers:0Issues:0Issues:0

inter-operator-scheduler

[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration

License:MITStargazers:0Issues:0Issues:0

KSAI-Lite

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

License:Apache-2.0Stargazers:0Issues:0Issues:0

MASTER-pytorch

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

License:MITStargazers:0Issues:0Issues:0

NN-CUDA-Example

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++License:MITStargazers:0Issues:1Issues:0

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

oneflow

OneFlow is a performance-centered and open-source deep learning framework.

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx

Open standard for machine learning interoperability

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx-simplifier

Simplify your onnx model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PL-Compiler-Resource

程序语言与编译技术相关资料(持续更新中)

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

portrait-matting-unet-flask

Portrait Mating implementation in UNet with PyTorch.

License:GPL-3.0Stargazers:0Issues:0Issues:0

pylint

It's not just a linter that annoys you!

License:GPL-2.0Stargazers:0Issues:0Issues:0

RASP

An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"

Stargazers:0Issues:0Issues:0

service-streamer

Boosting your Web Services of Deep Learning Applications.

License:Apache-2.0Stargazers:0Issues:0Issues:0

taichi

Productive & portable programming language for high-performance, sparse & differentiable computing on CPUs & GPUs

License:MITStargazers:0Issues:0Issues:0

tedukuri

《算法竞赛进阶指南》资源社区

Stargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

License:Apache-2.0Stargazers:0Issues:0Issues:0

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a lightweight and high-performance deep learning framework for mobile inference. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework. TNN:由腾讯优图实验室和光影实验室协同打造,移动端高性能、轻量级推理框架,同时拥有跨平台、高性能、模型压缩、代码裁剪等众多突出优势。TNN框架在原有Rapidnet、ncnn框架的基础上进一步加强了移动端设备的支持以及性能优化,同时也借鉴了业界主流开源框架高性能和良好拓展性的优点。目前TNN已经在手Q、微视、P图等应用中落地,欢迎大家参与协同共建,促进TNN推理框架进一步完善。

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

TRTorch

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0