zhaoyang-star

zhaoyang-star

Geek Repo

Location:Beijing

Github PK Tool:Github PK Tool

zhaoyang-star's repositories

test_opencl_image_object

use opencl image object for NHWC tensor

Language:C++Stargazers:1Issues:2Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++License:UnlicenseStargazers:0Issues:0Issues:0

clpeak

A tool which profiles OpenCL devices to find their peak capacities

Language:C++License:UnlicenseStargazers:0Issues:0Issues:0

code-samples

Source code examples from the Parallel Forall Blog

Language:HTMLLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

minimal-opencl-on-windows

Minimal OpenCL program on Windows

Language:CLicense:MITStargazers:0Issues:0Issues:0

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

OpenCL-CLHPP

Khronos OpenCL-CLHPP

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenCL-Headers

Khronos OpenCL-Headers

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

License:Apache-2.0Stargazers:0Issues:0Issues:0

Paddle-Lite

Multi-platform high performance deep learning inference engine (『飞桨』多平台高性能深度学习预测引擎)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Paddle-Lite-Demo

lib, demo, model, data

License:Apache-2.0Stargazers:0Issues:0Issues:0

SNPE-UDL-TEST

UDL test for SNPE-1.31.0.522

Stargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

test1

gitskills

Stargazers:0Issues:0Issues:0

threadpool

Fork of a nice threadpool library written by Ronald Kriemann which can be found here: http://www.kriemann.name/Ronald/projects/threadpool/index.en.htm

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0