myy1966's starred repositories
Building-a-RISC-V-CPU-Core
This repository contains my work in completing the course titled "Building a RISC-V CPU Core" offered by the Linux Foundation through edX.
riscv-multi-core-lotr
RISCV core RV32I/E.4 threads in a ring architecture
optimize-gemm
How to optimize sgemm in single-thread ARM cpu, mutli-threads ARM cpu and Nvidia gpu
nontrivial-mips
NonTrivial-MIPS is a synthesizable superscalar MIPS processor with branch prediction and FPU support, and it is capable of booting linux.
EfficientConvolution
Implementation of an efficient convolution between 3D tensors and 4D tensors.
OpenMP-101
Learn OpenMP examples step by step
universal_NPU-CNN_accelerator
hardware design of universal NPU(CNN accelerator) for various convolution neural network
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
rp2040-pio-emulator
RP2040 emulator for the testing and debugging of PIO programs
systolic-array
A DSL for Systolic Arrays
perforator
Record "perf" performance metrics for individual functions/regions of an ELF binary.
fast-interconnects-demo
How to use fast GPU interconnects in practice
breath-of-the-wild-map
The Zelda game map, but in the browser