yuguo's repositories
ChatGLM-6B-in-DeepSpeed-Chat
ChatGLM-6B in DeepSpeed-Chat for DCU
GLM-Pretrain-in-Megatron-DeepSpeed
GLM-Pretrain in Megatron-Deepspeed for DCU
flash-attention-hip
Flash Attention 2 C API for Paddle-ROCM
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
docs
Documentations for PaddlePaddle
FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
oneflow
OneFlow is a performance-centered and open-source deep learning framework.
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Tensile
Stretching GPU performance for GEMMs and tensor contractions.
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
VkFFT
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library