Botao Zhou's starred repositories
Simple-Chatroom
A simple chat room to demonstrate knowledge in C++ using Bazel, Protocol Buffers, and gRPC.
cmake-project
CMake完整使用教程。CMake教程包括一系列循序渐进的任务,介绍CMake信息,展示如何实现目标。
KuiperInfer
带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
CUDA-Learn-Notes
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
soccer-temporal-knowledge-graph
足球领域的时间知识图谱:包括了本体的构建,数据的爬取,数据的解析,rdf/xml格式的数据生成,jena-fuseki用于存储,基于语义解析的问答系统
CUDATutorial
A self-learning tutorail for CUDA High Performance Programing.
HPC-Lab-SYSU
2020秋中山大学高性能计算课程课件与作业
Implicit-Im2col-for-Backpropagation
🚀An implicit im2col supporting backpropagation on CUDA, and a CNN backpropagation framework.
implicit-gemm-tensor-core-convolution
Simple example of how to write an Implicit GEMM Convolution in CUDA using the tensor core WMMA API and bindings for PyTorch.
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
news-recommendation
Implementations of some methods in news recommendation.
HPC-Learning-Notes
高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!
NLP_textClassifier
基于word2vec预训练词向量; textCNN 模型 ;charCNN 模型 ;Bi-LSTM模型;Bi-LSTM + Attention 模型 ;Transformer 模型 ;ELMo 预训练模型 ;BERT 预训练模型的文本分类项目
Yahoo-News-Dataset
Yahoo! news dataset of DeepCom (EMNLP2019)
recommenders
Best Practices on Recommendation Systems
wrox-pro-cuda-c
Sample code from the book "Professional CUDA C Programming"
TinyWebServer
:fire: Linux下C++轻量级WebServer服务器