Patrick Haller's starred repositories
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
llama2.mojo
Inference Llama 2 in one file of pure 🔥
lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
lsp-timeout.nvim
Start/stop LSP servers upon demand; keeps RAM usage low
fabricator
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
yet-another-retnet
A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (https://arxiv.org/pdf/2307.08621.pdf)
Vocabulary-Transfer
Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf