Michael Feil's repositories
hf-hub-ctranslate2
Connecting Transformers on HuggingFace Hub with CTranslate2
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
flash-deberta
Deberta, but Flash
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
academicpages
my personal website
candle
Minimalist ML framework for Rust
datachain
DataChain đź”— Process and curate unstructured data using local ML models and LLM calls
fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
JamAIBase
The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate on AI applications.
kubeai
Private Open AI on Kubernetes
nlm-ingestor
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
pylabrobot
An interactive & hardware agnostic interface for lab automation
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
qdrant-client
Python client for Qdrant vector search engine
samba-qa
Production RAG Based on API Controllers
sglang
SGLang is a fast serving framework for large language models and vision language models.
text-embeddings-inference
A blazing fast inference solution for text embeddings models
triton
Development repository for the Triton language and compiler
Verba
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
zerox
Zero shot pdf OCR with gpt-4o-mini