Mengchi Zhang (brad-mengchi)

brad-mengchi

Geek Repo

Company:Meta

Location:Menlo Park

Home Page:https://sites.google.com/site/mengchizhang/

Github PK Tool:Github PK Tool

Mengchi Zhang's repositories

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:1Issues:0
Language:HTMLLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

CHIPKIT

CHIPKIT: An agile, reusable open-source framework for rapid test chip development

Language:SystemVerilogStargazers:0Issues:0Issues:0

ck-artifact-evaluation

Public CK repository with materials and workflows to reproduce results from published papers or open competitions at ACM, IEEE and NeurIPS conferences and journals

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

cub

THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0
Language:CSSLicense:NOASSERTIONStargazers:0Issues:1Issues:0

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Galois

Galois: C++ library for multi-core and multi-node parallelization

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

gpgpu-sim_simulations

A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments for simulations that complete in a reasonable amount of time on GPGPU-Sim.

Language:HTMLLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

gpufs

GPUfs - File system support for NVIDIA GPUs

Language:CudaStargazers:0Issues:0Issues:0

ISCA-2021-Script

A collection of redistributable Python scripts to help organize ISCA 2021 (The 48th International Symposium on Computer Architecture).

Language:PythonLicense:GPL-2.0Stargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:1Issues:0

llvm-pass-skeleton

example LLVM pass

Language:CMakeLicense:MITStargazers:0Issues:0Issues:0

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

Language:C++Stargazers:0Issues:0Issues:0

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

MightyPC

Mighty toolkit for conference Program Chairs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

sst-gpgpusim

SST GPGPU Simulation Components

Language:C++Stargazers:0Issues:0Issues:0

thrust

The C++ parallel algorithms library.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchrec

Pytorch domain library for recommendation systems

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0