mikepapadim

followers

following

stars

University of Manchester

Remote

https://mpapadimitriou.com/

Organizations

beehive-lab

Michalis Papadimitriou's repositories

llama-shepherd-cli

A CLI to manage install and configure llama inference implemenation in multiple languages

Language:Python60 20

llama2.tornadovm.java

An extension to Llama2.java implementation accelerated with GPUs, using TornadoVM

Language:JavaNOASSERTION15 5 1

collage-non-tvm-fork

Collage non-forked version for POC

Language:PythonApache-2.01 20

vimrc

The ultimate Vim configuration (vimrc)

Language:Vim ScriptMIT1 10

commitgpt

Automatically generate commit messages using ChatGPT

Language:TypeScript000

devoxx2024

Language:Java000

fzf

:cherry_blossom: A command-line fuzzy finder

Language:GoMIT000

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonMIT000

java

Java bindings for TensorFlow

Language:JavaApache-2.0000

Jlama

Jlama is a pure Java implementation of a LLM inference engine.

Language:CApache-2.0000

jvm_allocation_ref

A toy application comparing primitive array allocation on heap with Panama off-heap memory segment allocation

Language:JavaMIT010

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.0000

Llama-2-Onnx

NOASSERTION000

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT000

llama2.java

Inference Llama 2 in one file of pure Java

Language:JavaMIT000

llama3.java

Practical Llama 3 inference in Java

NOASSERTION000

llamafile

Distribute and run LLMs with a single file.

Language:C++NOASSERTION000

llm-apps-java-spring-ai

Samples showing how to build Java applications powered by Generative AI and LLMs using Spring AI and Spring Boot.

Language:JavaApache-2.0000

mikepapadim

Github profile custom

020

mlir-tutorial

000

models

A collection of pre-trained, state-of-the-art models in the ONNX format

Language:Jupyter NotebookApache-2.0010

rjvm

A tiny JVM written in Rust. Learning project

Language:RustApache-2.0000

sd4j

Stable diffusion pipeline in Java using ONNX Runtime

Language:JavaUPL-1.0000

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Apache-2.0000

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Apache-2.0000

torchscript-to-tvm

Language:Cuda010

TornadoVM

Tornado: A practical and efficient heterogeneous programming framework for managed languages

Language:JavaApache-2.0000

tutorials

Tutorials for creating and using ONNX models

Apache-2.0000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0020

wasmtime

A fast and secure runtime for WebAssembly

Apache-2.0000