Yi Wang's repositories
mlx-examples
Examples in the MLX framework
Adv360-Pro-ZMK
Production repository for the all-new Advantage360 Professional using ZMK engine
mlx
MLX: An array framework for Apple silicon
tensorrtllm_backend
The Triton TensorRT-LLM Backend
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
iree-for-apple-platforms
This project builds the IREE compiler for macOS and the IREE runtime for macOS, iOS, watchOS, and tvOS
ml_collections
ML Collections is a library of Python Collections designed for ML use cases.
cpuinfo
CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
alpaca.cpp-ios
Locally run an Instruction-Tuned Chat-Style LLM
iree
👻
libyaml
Canonical source repository for LibYAML
re2
RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
iree-torch
Torch Frontend for IREE
torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
LLVM-On-iOS
Script to build LLVM and Clang projects for use in iOS app and example iOS app using LLVM to interpret C++ programs
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration