Fei Hu (feihugis)

feihugis

Geek Repo

Company:@Microsoft

Location:California, USA

Github PK Tool:Github PK Tool


Organizations
tensorflow

Fei Hu's repositories

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:1Issues:1Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-courses

:books: List of awesome university courses for learning Computer Science!

Stargazers:0Issues:1Issues:0

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Language:CudaLicense:MITStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0

dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

fastseq-1

An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

feihugis.github.io

Fei Hu's Blog

Language:HTMLStargazers:0Issues:2Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GPTQ-triton

GPTQ inference Triton kernel

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

graph-learn

graph-learn

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

hardware-effects

Demonstration of various hardware effects.

Language:C++License:MITStargazers:0Issues:0Issues:0

mesh

Mesh TensorFlow: Model Parallelism Made Easier

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++License:MITStargazers:0Issues:1Issues:0

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

photoprism

Personal Photo Management powered by Go and Google TensorFlow

Language:GoLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

runtime

A performant and modular runtime for TensorFlow

Language:MLIRLicense:Apache-2.0Stargazers:0Issues:1Issues:0

SPSCQueue

A bounded single-producer single-consumer wait-free and lock-free queue written in C++11

Language:C++License:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:1Issues:0

triton-adsbrain-backend

Common source, scripts and utilities for creating Triton backends.

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

triton-server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

TurboTransformers

a fast and user-friendly tool for transformer inference on CPU and GPU

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0