Chao Liu (monklof)

monklof

Geek Repo

Company:meituan

Location:Beijing, China

Home Page:monklof.xyz

Github PK Tool:Github PK Tool

Chao Liu's starred repositories

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37564Issues:997Issues:1142

faiss

A library for efficient similarity search and clustering of dense vectors.

linux-insides

A little bit about a linux kernel

Language:PythonLicense:NOASSERTIONStargazers:29682Issues:1360Issues:182

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:29315Issues:330Issues:5374

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

Language:LLVMLicense:NOASSERTIONStargazers:27196Issues:589Issues:73109

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:11463Issues:382Issues:3324

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:10356Issues:157Issues:3587

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

cudf

cuDF - GPU DataFrame Library

Language:C++License:Apache-2.0Stargazers:8089Issues:149Issues:6284

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:5004Issues:108Issues:951

mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Language:C++License:Apache-2.0Stargazers:4900Issues:230Issues:677

MegEngine

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

Language:C++License:Apache-2.0Stargazers:4742Issues:137Issues:370

HIP

HIP: C++ Heterogeneous-Compute Interface for Portability

patchelf

A small utility to modify the dynamic linker and RPATH of ELF executables

Language:CLicense:GPL-3.0Stargazers:3402Issues:78Issues:260

jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

Language:PythonLicense:Apache-2.0Stargazers:3041Issues:62Issues:340

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Language:PythonLicense:BSD-3-ClauseStargazers:2436Issues:69Issues:1446

asm

Learning assembly for Linux x86_64

Language:AssemblyLicense:NOASSERTIONStargazers:2164Issues:97Issues:10

tinyflow

Tutorial code on how to build your own Deep Learning System in 2k Lines

Language:C++License:Apache-2.0Stargazers:2004Issues:83Issues:8

sequence_tagging

Named Entity Recognition (LSTM + CRF) - Tensorflow

Language:PythonLicense:Apache-2.0Stargazers:1944Issues:73Issues:83

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

Language:C++License:NOASSERTIONStargazers:1457Issues:41Issues:118

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++License:MITStargazers:939Issues:44Issues:205

maxas

Assembler for NVIDIA Maxwell architecture

Language:SassLicense:MITStargazers:935Issues:88Issues:11

runtime

A performant and modular runtime for TensorFlow

Language:C++License:Apache-2.0Stargazers:750Issues:51Issues:72

d2l-tvm

Dive into Deep Learning Compiler

CS143-Compilers-Stanford

My solutions to the programming assignments of the Stanford Compiler course.

Language:C++Stargazers:340Issues:3Issues:0

gdev

First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.

stack-machine

A simple stack-based virtual machine in C++ with a Forth like programming language

rnn_benchmarks

RNN benchmarks of pytorch, tensorflow and theano