Aloha Li (alohali)

alohali

Geek Repo

Location:shanghai

Github PK Tool:Github PK Tool

Aloha Li's starred repositories

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:117Issues:0Issues:0

composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

Language:C++License:NOASSERTIONStargazers:264Issues:0Issues:0

recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Language:CudaLicense:Apache-2.0Stargazers:574Issues:0Issues:0

DeepRec

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

Language:C++License:Apache-2.0Stargazers:1002Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4997Issues:0Issues:0

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++License:Apache-2.0Stargazers:784Issues:0Issues:0

qtun

Yet another SIP003 plugin based on IETF-QUIC

Language:RustStargazers:119Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:1Issues:0Issues:0

ppl.nn

A primitive library for neural network

Language:C++License:Apache-2.0Stargazers:1254Issues:0Issues:0

x-deeplearning

An industrial deep learning framework for high-dimension sparse data

Language:PureBasicLicense:Apache-2.0Stargazers:4241Issues:0Issues:0

diaosj.github.io

Keep writing

Language:HTMLStargazers:1Issues:0Issues:0

NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Language:PythonLicense:Apache-2.0Stargazers:1025Issues:0Issues:0

benchmark-models

benchmark models for TNN, ncnn, MNN

Language:ShellLicense:BSD-3-ClauseStargazers:20Issues:0Issues:0

benchmarks

A benchmark framework for Tensorflow

Language:PythonLicense:Apache-2.0Stargazers:1140Issues:0Issues:0

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.

Language:C++License:NOASSERTIONStargazers:4344Issues:0Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:46427Issues:0Issues:0

HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

Language:C++License:Apache-2.0Stargazers:919Issues:0Issues:0

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonLicense:NOASSERTIONStargazers:14078Issues:0Issues:0

AI-Chip

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

Language:PHPStargazers:1618Issues:0Issues:0

MIPS_CPU

5-Segment Pipeline MIPS CPU

Language:VerilogStargazers:6Issues:0Issues:0

clpeak

A tool which profiles OpenCL devices to find their peak capacities

Language:C++License:Apache-2.0Stargazers:386Issues:0Issues:0

how-to-optimize-gemm

row-major matmul optimization

Language:C++License:GPL-3.0Stargazers:570Issues:0Issues:0

asmjit

Low-latency machine code generation

Language:C++License:ZlibStargazers:3889Issues:0Issues:0

FaceDetection-DSFD

腾讯优图高精度双分支人脸检测器

Language:PythonLicense:NOASSERTIONStargazers:2885Issues:0Issues:0
Language:C++Stargazers:11Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:19854Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:80791Issues:0Issues:0

caffe

Caffe: a fast open framework for deep learning.

Language:C++License:NOASSERTIONStargazers:672Issues:0Issues:0

caffe

Caffe: a fast open framework for deep learning.

Language:C++License:NOASSERTIONStargazers:33988Issues:0Issues:0

nnForge

Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends

Language:C++Stargazers:178Issues:0Issues:0