Robert Kirk (RobertKirk)

RobertKirk

Geek Repo

Company:@ucl-dark

Location:London

Home Page:https://robertkirk.github.io/

Twitter:@_robertkirk

Github PK Tool:Github PK Tool

Robert Kirk's repositories

tinystories-wrappers

Code for the TinyStories experiments from "Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks".

Language:Jupyter NotebookLicense:MITStargazers:5Issues:2Issues:2

roam-tools

A small but growing collection of tools for Roam Research

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

Graph-Comonads-from-Pebble-Games

Master Thesis code: Implementing Game Comonads in Finite Model Theory using Dependent Types in Idris

Language:IdrisStargazers:3Issues:0Issues:0

dotfiles

A collection of personal scripts, aliases and the like from my personal software engineering practice

Language:Vim scriptStargazers:2Issues:0Issues:0

roam-solarized-theme

A strict solarized Roam Research theme

Language:CSSLicense:MITStargazers:2Issues:1Issues:0

tmux-ram

Plug and play RAM percentage and icon indicator for Tmux

Language:ShellLicense:MITStargazers:1Issues:0Issues:0

client

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonStargazers:0Issues:0Issues:0

DeepRLAlgos

A collection of my own implementations of a variety of DeepRL Algorithms

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

check_pdb_hook

Pre-commit hook to check for exposed PDB statements in Python files

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dmcontrol-generalization-benchmark

DMControl Generalization Benchmark

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dmenu

My personal dmenu fork

Language:CLicense:MITStargazers:0Issues:0Issues:0

dwm

My personal fork of dwm

Language:CLicense:MITStargazers:0Issues:0Issues:0

homebrew-neovim-nightly

Homebrew Cask tap for nightly neovim

Language:RubyStargazers:0Issues:0Issues:0

marge-bot

A merge-bot for GitLab

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

nle

The NetHack Learning Environment

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:SCSSStargazers:0Issues:1Issues:0
Language:TypeScriptStargazers:0Issues:0Issues:0

scholar-alert-digest

Aggregate unread emails from Google Scholar alerts

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

st

My fork of Simple terminal, with some patches and colours applied.

Language:CLicense:MITStargazers:0Issues:0Issues:0

surfingkeys-conf

A SurfingKeys configuration which adds 200+ key mappings for 17+ unique sites and OmniBar search suggestions for 45+ sites

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

voyager

🚀 Secure HAProxy Ingress Controller for Kubernetes

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0