can's starred repositories
sensitive-word
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)
SearchEngine
搜索引擎原理
write-you-a-vector-db
A Vector Database Tutorial (over CMU-DB's BusTub system)
mattermost
Mattermost is an open source platform for secure collaboration across the entire software development lifecycle..
everyone-can-use-english
人人都能用英语
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
llama_index
LlamaIndex is a data framework for your LLM applications
Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
sese-engine
【sese-engine】新时代的搜索引擎!
search-engine-principle
搜索引擎原理详解,开源电子书
news-search-engine
新闻搜索引擎
grpc-http-proxy
A reverse proxy server which translate JSON HTTP requests to gRPC calls based on protoreflect
kafka-pixy
gRPC/REST proxy for Kafka
note-architect
架构师学习笔记仓库
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.