Viktor Ferenczi's starred repositories
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
CopilotKit
A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
parsimonious
The fastest pure-Python PEG parser I can muster
Awesome-GPT-Store
Custom GPT Store - A collection of major GPTS available in public
flashinfer
FlashInfer: Kernel Library for LLM Serving
super-json-mode
Low latency JSON generation using LLMs ⚡️
quiet-star
Code for Quiet-STaR
indydevtools
An opinionated, Agentic Engineering toolbox powered by LLM Agents to solve problems autonomously.
lark-grammars
Grammars suitable for lark parser and Hypothesis
tabbyAPI-gradio-loader
A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.
ST-tabbyAPI-loader
Loader extension for tabbyAPI in SillyTavern
ReeditShipManagement
Broad ship management solution tailor made for Draconis Expanse.
SE-ModDebugger
Modifies the Space Engineers IL Checker to aid in the debugging of mods
wolfram-model-variety
Codes to investigate Leibnizian ideas in Wolfram Model.