dulante00's repositories
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Coursera-ML-AndrewNg-Notes
吴恩达老师的机器学习课程个人笔记
curator
Apache Curator
dbeaver
Free universal database tool and SQL client
Discovery
🐳 Nepxion Discovery is an enhancement for Spring Cloud Discovery with gray release, router, weight, limitation, circuit breaker, degrade, isolation, monitor, tracing 灰度发布、路由、权重、限流、熔断、降级、隔离、监控、追踪
DiscoveryGuide
☀️ Nepxion Discovery is a solution for Spring Cloud with blue green, gray, weight, limitation, circuit breaker, degrade, isolation, tracing, dye, failover 蓝绿、灰度、权重、限流、熔断、降级、隔离、追踪、流量染色、故障转移的指南
free-programming-books
:books: Freely available programming books
GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
go-patterns
Curated list of Go design patterns, recipes and idioms
java-design-patterns
Design patterns implemented in Java
jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
netron
Visualizer for neural network, deep learning, and machine learning models
netty-4-user-guide-demos
Netty demos. (Netty 案例大全)
onnx-simplifier
Simplify your onnx model
paper-jam
Jam of papers that interest or bore me and my friends :P
paper-reading
深度学习经典、新论文逐段精读
PaperList
report reading paper list
ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
python-patterns
A collection of design patterns/idioms in Python
recommendation
深度学习文章分享
serve
Model Serving on PyTorch
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
soft-filter-pruning
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
spring-cloud-examples
Spring Cloud 学习案例,服务发现、服务治理、链路追踪、服务监控等
spring-cloud-gray
Spring Cloud版本控制和灰度starter
SpringCloud
基于SpringCloud2.1的微服务开发脚手架,整合了spring-security-oauth2、nacos、feign、sentinel、springcloud-gateway等。服务治理方面引入elasticsearch、skywalking、springboot-admin、zipkin等,让项目开发快速进入业务开发,而不需过多时间花费在架构搭建上。持续更新中
transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for Hugging Face transformer models 🚀