Devin zhang's repositories
CBAM-TensorFlow2.0
CBAM(Convolutional Block Attention Module) implementation on TensowFlow2.0
cp_decomposition
Matrix&Tensor decomposition
model_deployment
本仓库提供了模型部署与量化等相关学习资料和参考资料、丰富的代码范例。主要涉及ONNX/TensorRT/PPQ/Triton等。
YOLOv2-TensorFlow2.0
A minimal Tensorflow2.0 implementation of YOLOv2.
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
FasterTransformer
Transformer related optimization, including BERT, GPT
KuiperInfer
带你从零实现一个高性能的深度学习推理库,支持Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
subtitle-translator-electron
↔️ Translate subtitle using ChatGPT
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production