Sam Wu (samjwu)

samjwu

Geek Repo

Company:@AMD

Home Page:samjwu.github.io

Github PK Tool:Github PK Tool


Organizations
amd
GPUOpen-ProfessionalCompute-Libraries
RadeonOpenCompute
ROCm
ROCm-Developer-Tools
ROCmSoftwarePlatform

Sam Wu's repositories

composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

hip-python

HIP Python Low-level Bindings

Language:CythonLicense:MITStargazers:0Issues:0Issues:0

hipBLAS

ROCm BLAS marshalling library

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

hipBLASLt

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

Language:AssemblyLicense:MITStargazers:0Issues:0Issues:0

hipfort

Fortran interfaces for ROCm libraries

Language:FortranLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0

hipTensor

AMD’s C++ library for accelerating tensor primitives

Language:C++License:MITStargazers:0Issues:0Issues:0

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

License:NOASSERTIONStargazers:0Issues:0Issues:0

MIVisionX

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

Language:C++License:MITStargazers:0Issues:0Issues:0

ProgrammingProblemsPython

Programming problems solved in Python

Language:PythonStargazers:0Issues:0Issues:0

rccl

ROCm Communication Collectives Library (RCCL)

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0

rocAL

The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a processing graph programmable by the user.

Language:C++License:MITStargazers:0Issues:0Issues:0

rocDecode

rocDecode is a high performance video decode SDK for AMD hardware

License:NOASSERTIONStargazers:0Issues:0Issues:0

ROCm

ROCm - Open Software Platform for GPU Compute

Language:ShellLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

rocm-cmake

CMake modules used within the ROCm libraries

Language:CMakeLicense:MITStargazers:0Issues:0Issues:0

rocm-docs-core

ROCm Documentation Python package for ReadTheDocs build standardization

Language:CSSLicense:NOASSERTIONStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

rocPRIM

ROCm Parallel Primitives

Language:C++License:MITStargazers:0Issues:0Issues:0

rocr_debug_agent

The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

rocRAND

RAND library for HIP programming language

Language:C++License:MITStargazers:0Issues:0Issues:0

rocThrust

ROCm Thrust - run Thrust dependent software on AMD GPUs

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

rocWMMA

rocWMMA

Language:C++License:MITStargazers:0Issues:0Issues:0

rpp

AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

Language:C++License:MITStargazers:0Issues:0Issues:0

samjwu.github.io

Personal Website

Language:HTMLStargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:0Issues:0

SlurmSetup

Ansible playbooks to set up SLURM

Language:YAMLStargazers:0Issues:0Issues:0

Tensile

Stretching GPU performance for GEMMs and tensor contractions.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0