Tianlei Wu's repositories
Stable-Diffusion-WebUI-OnnxRuntime
Extension for Automatic1111's Stable Diffusion WebUI, using OnnxRuntime CUDA execution provider to deliver high performance result on Nvidia GPU.
bert
TensorFlow code and pre-trained models for BERT
CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
diffusers
🤗 Diffusers: experiment of diffusion ONNX models
gdrivedl
Google Drive Download Python Script
inference
Reference implementations of inference benchmarks
onnx
Open Neural Network Exchange
segment-anything
ONNX Runtime support for SAM
TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
tutorials
Tutorials for creating and using ONNX models
libflash_attn
Standalone Flash Attention v2 kernel without libtorch dependency
onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
OrtMultiThreadCSharp
Test ORT with multiple threading
unsloth
2-5X faster 70% less memory QLoRA & LoRA finetuning