Ben Sander (bensander)

bensander

Geek Repo

Company:AMD

Location:United States

Github PK Tool:Github PK Tool

Ben Sander's starred repositories

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:186039Issues:7592Issues:39880

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:82962Issues:1740Issues:45622

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:35100Issues:343Issues:2759

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:17761Issues:439Issues:2817

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonLicense:NOASSERTIONStargazers:14219Issues:335Issues:2241

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:11694Issues:376Issues:3383

ROCm

AMD ROCm™ Software - GitHub Home

Language:ShellLicense:MITStargazers:4565Issues:215Issues:2372

HIP

HIP: C++ Heterogeneous-Compute Interface for Portability

glow

Compiler for Neural Network hardware accelerators

Language:C++License:Apache-2.0Stargazers:3217Issues:154Issues:827

ispc

Intel® Implicit SPMD Program Compiler

Language:C++License:BSD-3-ClauseStargazers:2498Issues:94Issues:1256

mlir

"Multi-Level Intermediate Representation" Compiler Infrastructure

MIOpen

AMD's Machine Intelligence Library

Language:AssemblyLicense:NOASSERTIONStargazers:1062Issues:90Issues:1042

tiramisu

A polyhedral compiler for expressing fast and portable data parallel algorithms

Language:C++License:MITStargazers:916Issues:45Issues:64

ROCm-docker

Dockerfiles for the various software layers defined in the ROCm software platform

Language:ShellLicense:MITStargazers:422Issues:59Issues:66

radeon_gpu_profiler

Radeon GPU Profiler (RGP) is a tool from AMD that allows for deep inspection of GPU workloads.

deepmark

THE Deep Learning Benchmarks

Language:LuaLicense:Apache-2.0Stargazers:352Issues:72Issues:6

ngraph-python

Original Python version of Intel® Nervana™ Graph

Language:PythonLicense:Apache-2.0Stargazers:215Issues:50Issues:14

Tensile

Stretching GPU performance for GEMMs and tensor contractions.

Language:PythonLicense:MITStargazers:215Issues:55Issues:101

hipCaffe

(Deprecated) hipCaffe: the HIP port of Caffe

Language:C++License:NOASSERTIONStargazers:124Issues:19Issues:34

HIP-CPU

An implementation of HIP that works on CPUs, across OSes.

Language:C++License:MITStargazers:112Issues:19Issues:37

gpumembench

A GPU benchmark suite for assessing on-chip GPU memory bandwidth

Language:C++License:GPL-2.0Stargazers:99Issues:10Issues:3

training_policies

Issues related to MLPerf™ training policies, including rules and suggested changes

Language:PythonLicense:Apache-2.0Stargazers:92Issues:39Issues:363

fathom

Reference workloads for modern deep learning methods.

Language:PythonLicense:Apache-2.0Stargazers:73Issues:20Issues:36

deepSpeech2

End-to-end speech recognition using TensorFlow

Language:PythonLicense:BSD-3-ClauseStargazers:50Issues:9Issues:0

Thrust

HIP back-end for Thrust that has been replaced by rocThrust

Language:C++License:Apache-2.0Stargazers:28Issues:14Issues:15
Language:CudaLicense:NOASSERTIONStargazers:14Issues:24Issues:17

machine-learning

repository for notes and data from machine learning studies