RobertKirk

Robert Kirk's repositories

tinystories-wrappers

Code for the TinyStories experiments from "Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks".

Language:Jupyter NotebookMIT5 2 2

roam-tools

A small but growing collection of tools for Roam Research

Language:PythonMIT4 20

Graph-Comonads-from-Pebble-Games

Master Thesis code: Implementing Game Comonads in Finite Model Theory using Dependent Types in Idris

Language:Idris300

dotfiles

A collection of personal scripts, aliases and the like from my personal software engineering practice

Language:Vim script200

roam-solarized-theme

A strict solarized Roam Research theme

Language:CSSMIT2 10

tmux-ram

Plug and play RAM percentage and icon indicator for Tmux

Language:ShellMIT100

client

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:Python000

DeepRLAlgos

A collection of my own implementations of a variety of DeepRL Algorithms

Language:Jupyter Notebook000

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonMIT000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

check_pdb_hook

Pre-commit hook to check for exposed PDB statements in Python files

Language:PythonMIT000

dmcontrol-generalization-benchmark

DMControl Generalization Benchmark

Language:PythonMIT000

dmenu

My personal dmenu fork

Language:CMIT000

dwm

My personal fork of dwm

Language:CMIT000

homebrew-neovim-nightly

Homebrew Cask tap for nightly neovim

Language:Ruby000

marge-bot

A merge-bot for GitLab

Language:PythonBSD-3-Clause000

nle

The NetHack Learning Environment

Language:CNOASSERTION000

rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

NOASSERTION000

RobertKirk.github.io

personal blog

Language:SCSS010

RSSPlaylister

Language:TypeScript000

scholar-alert-digest

Aggregate unread emails from Google Scholar alerts

Language:GoApache-2.0000

st

My fork of Simple terminal, with some patches and colours applied.

Language:CMIT000

surfingkeys-conf

A SurfingKeys configuration which adds 200+ key mappings for 17+ unique sites and OmniBar search suggestions for 45+ sites

Language:JavaScriptMIT000

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT000

voyager

🚀 Secure HAProxy Ingress Controller for Kubernetes

Language:GoApache-2.0000

weak-to-strong

MIT000