hppanev's starred repositories
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Megatron-LM
Ongoing research training transformer models at scale
text-generation-inference
Large Language Model Text Generation Inference
SillyTavern
LLM Frontend for Power Users.
cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
secret-llama
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.
pywinassistant
The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.
llm-reasoners
A library for advanced large language model reasoning
EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
llamaduo
This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. For this project, we have initially chosen Gemini 1.0 Pro for service type LLM and Gemma 2B/7B for small sized LLM model. It now supports other service LLMs such as GPT4 and Claude3.
reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly"
NeoSapiens
The next evolution of Agents