Meng, Hengyu (airMeng)

airMeng

Geek Repo

Company:Intel

Location:Shanghai

Home Page:https://read.cv/hym

Github PK Tool:Github PK Tool


Organizations
RunoobHelpsRunoob

Meng, Hengyu's starred repositories

intel-npu-acceleration-library

Intel® NPU Acceleration Library

Language:PythonLicense:Apache-2.0Stargazers:333Issues:0Issues:0

neural-speed

An innovative library for efficient LLM inference via low-bit quantization

Language:C++License:Apache-2.0Stargazers:287Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:50Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:CudaStargazers:1084Issues:0Issues:0

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonLicense:Apache-2.0Stargazers:2003Issues:0Issues:0

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonLicense:Apache-2.0Stargazers:2029Issues:0Issues:0

x86-64-minimal-JIT-compiler-Cpp

Writing a minimal x86-64 JIT compiler in C++

Language:C++License:GPL-3.0Stargazers:93Issues:0Issues:0

optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:342Issues:0Issues:0

intel-extension-for-tensorflow

Intel® Extension for TensorFlow*

Language:C++License:NOASSERTIONStargazers:306Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:174Issues:0Issues:0

mlir-hello

MLIR Sample dialect

Stargazers:90Issues:0Issues:0

mlirx

MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com

Stargazers:37Issues:0Issues:0

pasl

Parallel Algorithm Scheduling Library

Language:C++License:Apache-2.0Stargazers:100Issues:0Issues:0

sgemm_hsw

This is an implementation of sgemm_kernel on L1d cache.

Language:AssemblyLicense:GPL-3.0Stargazers:214Issues:0Issues:0

bril

an educational compiler intermediate representation

Language:RustLicense:MITStargazers:473Issues:0Issues:0

ppl.nn

A primitive library for neural network

Language:C++License:Apache-2.0Stargazers:1236Issues:0Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:2229Issues:0Issues:0

sparsednn

Fast sparse deep learning on CPUs

Language:PythonLicense:Apache-2.0Stargazers:51Issues:0Issues:0

maxas

Assembler for NVIDIA Maxwell architecture

Language:SassLicense:MITStargazers:921Issues:0Issues:0

oneAPI-samples

Samples for Intel® oneAPI Toolkits

Language:C++License:MITStargazers:867Issues:0Issues:0

easy-just-in-time

LLVM Optimization to extract a function, embedded in its intermediate representation in the binary, and execute it using the LLVM Just-In-Time compiler.

Language:C++License:BSD-3-ClauseStargazers:505Issues:0Issues:0

onnx2pytorch

Transform ONNX model to PyTorch representation

Language:PythonLicense:Apache-2.0Stargazers:297Issues:0Issues:0

GEMM_Optimization

Optimize GEMM. With AVX512 and AVX512-BF16, 800x improvement.

Language:C++Stargazers:14Issues:0Issues:0

dpcpp-tutorial

Intel Data Parallel C++ (and SYCL 2020) Tutorial.

Language:C++License:MITStargazers:89Issues:0Issues:0

lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

Language:C++License:NOASSERTIONStargazers:3111Issues:0Issues:0

onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

Language:C++License:Apache-2.0Stargazers:701Issues:0Issues:0

ipex_verbose

ipex verbose toolkit

Language:PythonStargazers:2Issues:0Issues:0

PySparseConvNet

Python Framework for sparse neural networks

Language:CudaStargazers:19Issues:0Issues:0

mtensor

a c++/cuda template library for tensor lazy evaluation

Language:C++License:NOASSERTIONStargazers:159Issues:0Issues:0

MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Language:PythonLicense:NOASSERTIONStargazers:2329Issues:0Issues:0