jssonx

followers

following

stars

Cornell ECE

Houston, TX

https://jssonx.github.io

Jsson Xia's repositories

awesome-gemm

A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software.

MIT200

cs149-parallel-computing

Stanford CS149 Parallel Computing

1 10

eva-lang

A functional programming language in JavaScript.

Language:JavaScript1 10

jssonx

1 10

lightneuron

An educational inference framwork.

Language:CMIT1 10

gemm-kernel-microbenchmark

A microbenchmark for GEMM kernels on NVIDIA GPUs with Ampere Architecture.

Language:C++000

hands-on-simd-programming

Hands-on SIMD Programming with C++.

Language:C++000

leakcheck

Memory leak detector (MLD) for C applications.

Language:CMIT000

nlp-with-spark

Insight Mastodon: NLP Analysis with Spark

Language:Python000

algo-playground

optimize to push the limits.

Language:Python000

asst1

Stanford CS149 -- Assignment 1

Language:C++000

asst2

Stanford CS149 -- Assignment 2

Language:C++000

asst4

Stanford CS149 -- Assignment 4

Language:C++000

batched_gemm

Language:C000

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Apache-2.0000

compute-benchmarks

Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver

MIT000

compute-runtime

Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver

MIT000

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

hatchet

Graph-indexed Pandas DataFrames for analyzing hierarchical performance data

MIT000

hpcconf

Language:HTML000

level-zero

oneAPI Level Zero Specification Headers and Loader

MIT000

lz77

LZ77 in C.

Language:CMIT000

oneAPI-samples

Samples for Intel® oneAPI Toolkits

MIT000

pti-gpu

Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily

Language:C++MIT000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Apache-2.0000

RookieDB

Berkeley CS186: Introduction to Database Systems

Language:Java000

smith-waterman

Pairwise sequence alignment algorithm.

Language:C++MIT000

sycl-samples

Language:C++000

tapa

TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerators.

MIT000