Michael Feil (michaelfeil)

michaelfeil

User data from Github https://github.com/michaelfeil

Company:@basetenlabs

Location:San Francisco

Home Page:michaelfeil.eu

GitHub:@michaelfeil

Twitter:@feilsystem

Michael Feil's repositories

infinity

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Language:PythonLicense:MITStargazers:1799Issues:19Issues:205

embed

A stable, fast and easy-to-use inference library with a focus on a sync-to-async API

hf-hub-ctranslate2

Connecting Transformers on HuggingFace Hub with CTranslate2

Language:PythonLicense:MITStargazers:36Issues:2Issues:12

skyjo_rl

Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB

Language:Jupyter NotebookLicense:MITStargazers:12Issues:1Issues:1
Language:C++License:Apache-2.0Stargazers:10Issues:0Issues:0

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

flash-deberta

Deberta, but Flash

Language:PythonStargazers:1Issues:1Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

academicpages

my personal website

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

candle

Minimalist ML framework for Rust

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

datachain

DataChain đź”— Process and curate unstructured data using local ML models and LLM calls

License:Apache-2.0Stargazers:0Issues:0Issues:0

fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

JamAIBase

The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate on AI applications.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kubeai

Private Open AI on Kubernetes

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

nlm-ingestor

This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pylabrobot

An interactive & hardware agnostic interface for lab automation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

qdrant

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

License:Apache-2.0Stargazers:0Issues:0Issues:0

qdrant-client

Python client for Qdrant vector search engine

License:Apache-2.0Stargazers:0Issues:0Issues:0

samba-qa

Production RAG Based on API Controllers

License:Apache-2.0Stargazers:0Issues:0Issues:0

sglang

SGLang is a fast serving framework for large language models and vision language models.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

text-embeddings-inference

A blazing fast inference solution for text embeddings models

License:Apache-2.0Stargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0

Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

zerox

Zero shot pdf OCR with gpt-4o-mini

Language:PythonLicense:MITStargazers:0Issues:0Issues:0