jerryw95's starred repositories
basic_verilog
Must-have verilog systemverilog modules
rad-flow
The RAD flow is an open-source academic architecture exploration and evaluation flow for novel beyond-FPGA reconfigurable acceleration devices (RADs). These devices incorporate conventional FPGA fabrics, several coarse-grained domain-specialized accelerator blocks, and high-performance networks-on-chip for system-level communication.
systemverilog
SystemVerilog for ASIC/FPGA Design & Simulation, with Synopsys Tool Flow
CUDA-Learn-Notes
🎉CUDA&C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Near-Maximum-Independent-Set
Near-linear time algorithm for computing near-maximum independent set
MinimumVertexCover_DRL
Learning to solve Minimum Vertex Cover using Graph Convolutional Networks and RL
GCN_Partitioning
Graph Partitoning Using Graph Convolutional Networks
graph-partitioning-algorithms
Multi-way graph partitioning algorithms: FMS (Fiduccia-Mattheyses-Sanchis), PLM (Partitioning by Locked Moves), PFM (Partitioning by Free Moves)
Graph-Partitioning
The implementation is based on the Fiduccia-Mattheyses algorithm.
CNN-Accelerator-VLSI
Convolutional accelerator kernel, target ASIC & FPGA
hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
suitesparse-metis-for-windows
CMake scripts for painless usage of SuiteSparse+METIS from Visual Studio and the rest of Windows/Linux/OSX IDEs supported by CMake
CUDALibrarySamples
CUDA Library Samples