kgourgou

Kosti's repositories

machine-learning-zettelkasten

Insert backlinks to markdown documents by using sklearn and cosine similarity.

Language:PythonMIT8 20

kgourgou

MIT2 20

mlx-examples

Examples in the MLX framework

Language:PythonMIT1 10

raggler

Simple, local-first, retrieval-augmented generation without tears (or API keys). Powered by MLX and AlphaMonarch.

Language:PythonMIT1 20

Linear-Token-Predictor

This is a reproduction of the model used in Malach, E., 2023. Auto-regressive next-token predictors are universal learners. arXiv preprint arXiv:2309.06979, Section 4.1.

Language:Jupyter NotebookMIT010

Things-AI

An experiment in having LLM support for the Things 3 todo app.

Language:PythonMIT000

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0010

curator

Apache-2.0000

DAM

Language:Python000

diff_history

[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)

Language:PythonMIT010

diffuse-distributions

Forcing Diffuse Distributions out of Language Models

Language:PythonMIT000

DSPy-Text2SQL

DSPY on action with OpenSource LLMs.

Language:Jupyter Notebook000

entropix

Entropy Based Sampling and Parallel CoT Decoding

Language:PythonApache-2.0000

folio-jekyll

Language:HTMLMIT020

LESS

Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookMIT000

llm-autoeval

Automatically evaluate your LLMs in Google Colab

Language:PythonMIT000

LLMRank

PageRank for LLMs

Apache-2.0000

mamba.py

An efficient Mamba implementation in PyTorch and MLX.

Language:Python010

mlx

MLX: An array framework for Apple silicon

Language:C++MIT010

neural_net_checklist

Language:PythonMIT000

outlines

Structured Text Generation

Language:PythonApache-2.0000

pdf-renamer-server

A python tool to automatically rename the pdf files of scientific publications by looking up the publication metadata on the web.

Language:Python010

posteriors

Uncertainty quantification with PyTorch

Language:PythonApache-2.0000

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

MIT000

SAE-based-representation-engineering

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

MIT000

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

MIT000

struct_gen_utils

Language:MakefileNOASSERTION010

thermox

Exact OU processes with JAX

Language:PythonApache-2.0000

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonMIT000

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonGPL-3.0010