Robert Kirk's repositories
tinystories-wrappers
Code for the TinyStories experiments from "Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks".
roam-tools
A small but growing collection of tools for Roam Research
Graph-Comonads-from-Pebble-Games
Master Thesis code: Implementing Game Comonads in Finite Model Theory using Dependent Types in Idris
roam-solarized-theme
A strict solarized Roam Research theme
client
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
DeepRLAlgos
A collection of my own implementations of a variety of DeepRL Algorithms
phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
check_pdb_hook
Pre-commit hook to check for exposed PDB statements in Python files
dmcontrol-generalization-benchmark
DMControl Generalization Benchmark
dmenu
My personal dmenu fork
dwm
My personal fork of dwm
homebrew-neovim-nightly
Homebrew Cask tap for nightly neovim
marge-bot
A merge-bot for GitLab
nle
The NetHack Learning Environment
rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
RobertKirk.github.io
personal blog
scholar-alert-digest
Aggregate unread emails from Google Scholar alerts
st
My fork of Simple terminal, with some patches and colours applied.
surfingkeys-conf
A SurfingKeys configuration which adds 200+ key mappings for 17+ unique sites and OmniBar search suggestions for 45+ sites
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
voyager
🚀 Secure HAProxy Ingress Controller for Kubernetes