Josh Minor's starred repositories
llama_index
LlamaIndex is a data framework for your LLM applications
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
alpaca-lora
Instruct-tune LLaMA on consumer hardware
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
proxychains
proxychains - a tool that forces any TCP connection made by any given application to follow through proxy like TOR or any other SOCKS4, SOCKS5 or HTTP(S) proxy. Supported auth-types: "user/pass" for SOCKS4/5, "basic" for HTTP.
VectorDBBench
A Benchmark Tool for VectorDB
tinymembench
Simple benchmark for memory throughput and latency
DAMOV
DAMOV is a benchmark suite and a methodical framework targeting the study of data movement bottlenecks in modern applications. It is intended to study new architectures, such as near-data processing. Described by Oliveira et al. (preliminary version at https://arxiv.org/pdf/2105.03725.pdf)
armnn_tflite_backend
TensorFlow Lite backend with ArmNN delegate support for Nvidia Triton
smarter-inference
Inference as a Service with Arm optimized Triton and a custom admission controller.