ltj2013

ltj2013

Geek Repo

Github PK Tool:Github PK Tool

ltj2013's starred repositories

ppl.cv

ppl.cv is a high-performance image processing library of openPPL supporting various platforms.

Language:C++License:Apache-2.0Stargazers:484Issues:0Issues:0

ppl.nn

A primitive library for neural network

Language:C++License:Apache-2.0Stargazers:1254Issues:0Issues:0

AutoKernel

AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。

Language:C++License:Apache-2.0Stargazers:777Issues:0Issues:0
Language:C++Stargazers:39Issues:0Issues:0

Fractional-GPUs

Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions

Language:CStargazers:149Issues:0Issues:0

KeplerAs

An Open Source Kepler GPU Assembler

Language:PerlLicense:MITStargazers:17Issues:0Issues:0

tvm-cuda-int8-benchmark

Benchmark of TVM quantized model on CUDA

Language:PythonStargazers:113Issues:0Issues:0

laser

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers

Language:NimLicense:Apache-2.0Stargazers:266Issues:0Issues:0

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.

Language:C++License:NOASSERTIONStargazers:4344Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:19843Issues:0Issues:0

turingas

Assembler for NVIDIA Volta and Turing GPUs

Language:PythonLicense:MITStargazers:190Issues:0Issues:0

12306

12306智能刷票,订票

Language:PythonLicense:MITStargazers:33731Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:12084Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:10342Issues:0Issues:0

zh-google-styleguide

Google 开源项目风格指南 (中文版)

Language:MakefileStargazers:10485Issues:0Issues:0

maxas

Assembler for NVIDIA Maxwell architecture

Language:SassLicense:MITStargazers:935Issues:0Issues:0

caffe-fixedpoint

minimized caffe, include only inference part, and support fixed point computation

Language:C++Stargazers:6Issues:0Issues:0
Language:VimLStargazers:6Issues:0Issues:0

asfermi

assembler for NVIDIA FERMI. Imported from Google Code

Language:C++Stargazers:66Issues:0Issues:0

gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of a contemporary GPU (such as NVIDIA's Fermi and GT200 architectures) running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:11456Issues:0Issues:0

gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.

Language:C++License:NOASSERTIONStargazers:1043Issues:0Issues:0

CaffeModelCompression

Tool to compress trained caffe weights

Language:CStargazers:106Issues:0Issues:0

caffe

Caffe for Sparse and Low-rank Deep Neural Networks

Language:C++License:NOASSERTIONStargazers:374Issues:0Issues:0