Zhengyuan Han's starred repositories
LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
FAQ_Of_LLM_Interview
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
llm-cookbook
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
InterviewGuide
🔥🔥「InterviewGuide」是阿秀从校园->职场多年计算机自学过程的记录以及学弟学妹们计算机校招&秋招经验总结文章的汇总,包括但不限于C/C++ 、Golang、JavaScript、Vue、操作系统、数据结构、计算机网络、MySQL、Redis等学习总结,坚持学习,持续成长!
MSC-stencil-compiler
The code of paper "Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core" Processors.
prSpMV-public
prSpMV is a SpMV kernel written in C. Artifact for our paper in ICCD2023.
spv8-public
SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.
Benchmark_SpMV_using_CSR5
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
Sunway-testing-benchmarks
some testing of Sunway Taihulight (SW26010) in learning parallel programming
CUDA-Learn-Notes
🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
SpMV-CNN-Model
SpMV-CNN: A set of convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product
ssr-bbr-vpn
小白VPN搭建图文教程/使用国外服务器自建ssr/开启加速bbr/访问vpn/搭建教程/科学上网/翻墙10分钟搞定
quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
Papers-Graphs-with-Heterophily
A Survey of Learning from Graphs with Heterophily
Mat2Stencil
A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid.
openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
Modern-CMake-for-Cpp
Modern CMake for C++, published by Packt