viktor-ferenczi

Viktor Ferenczi's starred repositories

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Language:SvelteMIT30690 169 1474

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT19133 297 1332

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonMIT18958 271 310

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonMIT17070 204 612

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonApache-2.014571 127 3343

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT8719 81 35

CopilotKit

A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas.

Language:TypeScriptMIT7718 61 93

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07422 85 1562

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT5970 37 849

lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Language:PythonMIT4618 59 881

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.04504 50 93

promptfoo

Test your prompts, agents, and RAGs. Use LLM evals to improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

Language:TypeScriptMIT3564 18 483

gprof2dot

Converts profiling output to a dot graph.

Language:PythonLGPL-3.03145 79 57

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Language:PythonApache-2.02806 30 286

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.02067 22 169

parsimonious

The fastest pure-Python PEG parser I can muster

Language:PythonMIT1788 42 162

Awesome-GPT-Store

Custom GPT Store - A collection of major GPTS available in public

MIT1415 23 196

ATLAS

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Language:PythonApache-2.0866 21 7