Jsson Xia (jssonx)

jssonx

Geek Repo

Company:Cornell ECE

Location:Houston, TX

Home Page:https://jssonx.github.io

Twitter:@jssonxia

Github PK Tool:Github PK Tool

Jsson Xia's repositories

awesome-gemm

A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software.

License:MITStargazers:2Issues:0Issues:0

cs149-parallel-computing

Stanford CS149 Parallel Computing

eva-lang

A functional programming language in JavaScript.

Language:JavaScriptStargazers:1Issues:1Issues:0

lightneuron

An educational inference framwork.

Language:CLicense:MITStargazers:1Issues:1Issues:0

gemm-kernel-microbenchmark

A microbenchmark for GEMM kernels on NVIDIA GPUs with Ampere Architecture.

Language:C++Stargazers:0Issues:0Issues:0

hands-on-simd-programming

Hands-on SIMD Programming with C++.

Language:C++Stargazers:0Issues:0Issues:0

leakcheck

Memory leak detector (MLD) for C applications.

Language:CLicense:MITStargazers:0Issues:0Issues:0

nlp-with-spark

Insight Mastodon: NLP Analysis with Spark

Language:PythonStargazers:0Issues:0Issues:0

algo-playground

optimize to push the limits.

Language:PythonStargazers:0Issues:0Issues:0

asst1

Stanford CS149 -- Assignment 1

Language:C++Stargazers:0Issues:0Issues:0

asst2

Stanford CS149 -- Assignment 2

Language:C++Stargazers:0Issues:0Issues:0

asst4

Stanford CS149 -- Assignment 4

Language:C++Stargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

License:Apache-2.0Stargazers:0Issues:0Issues:0

compute-benchmarks

Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver

License:MITStargazers:0Issues:0Issues:0

compute-runtime

Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver

License:MITStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

License:Apache-2.0Stargazers:0Issues:0Issues:0

hatchet

Graph-indexed Pandas DataFrames for analyzing hierarchical performance data

License:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

level-zero

oneAPI Level Zero Specification Headers and Loader

License:MITStargazers:0Issues:0Issues:0

lz77

LZ77 in C.

Language:CLicense:MITStargazers:0Issues:0Issues:0

oneAPI-samples

Samples for Intel® oneAPI Toolkits

License:MITStargazers:0Issues:0Issues:0

pti-gpu

Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily

Language:C++License:MITStargazers:0Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0

RookieDB

Berkeley CS186: Introduction to Database Systems

Language:JavaStargazers:0Issues:0Issues:0

smith-waterman

Pairwise sequence alignment algorithm.

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

tapa

TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerators.

License:MITStargazers:0Issues:0Issues:0