Xinfeng (GD06)

GD06

Geek Repo

Company:UC, Santa Barbara

Location:https://seal.ece.ucsb.edu/location

Github PK Tool:Github PK Tool

Xinfeng's repositories

Language:PythonLicense:MITStargazers:11Issues:2Issues:0

MPU-ASPLOS-2021

Source code of MPU simulator and compiler for ASPLOS 2021 submission.

Language:PythonStargazers:3Issues:0Issues:0

cudnn-tuning

Codes for auto-tuning cudnn conv forward implementations

Language:PythonStargazers:1Issues:0Issues:0

mkldnn-perf

Testing the performance of the MKL-DNN

Language:C++Stargazers:1Issues:0Issues:0

caffe

Caffe: a fast open framework for deep learning.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

caffe-tensorflow

Caffe models in TensorFlow

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

cublas_perf

Testing the performance of the cuBLAS

Language:C++Stargazers:0Issues:0Issues:0

cuda-convnet2

Automatically exported from code.google.com/p/cuda-convnet2

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fathom

Reference workloads for modern deep learning methods.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

Halide

a language for fast, portable data-parallel computation

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

leveldb

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mpu-homepage

Homepage of the MPU project based on the Cayman theme.

Language:HTMLLicense:CC0-1.0Stargazers:0Issues:0Issues:0

mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

NiftyRec

NiftyRec is a software toolbox for Tomographic image reconstruction. NiftyRec is written in C and computationally intensive functions have a GPU accelerated version based on NVidia CUDA. NiftyRec includes a Matlab Toolbox and a Python Package that access the low level routines, hiding the complexity of the GPU accelerated algorithms.

Language:CLicense:NOASSERTIONStargazers:0Issues:2Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

License:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-cifar

95.16% on CIFAR10 with PyTorch

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

torchrec

Pytorch domain library for recommendation systems

License:BSD-3-ClauseStargazers:0Issues:0Issues:0