Lorenz Wolf (lorenzflow)

lorenzflow

Geek Repo

Company:University College London

Location:London

Home Page:https://lorenz-wolf.netlify.app

Github PK Tool:Github PK Tool

Lorenz Wolf's starred repositories

Kernel-Functional-Data

Jupyter Notebook of code used in the numerics for the paper "A Kernel Two-Sample Test for Functional Data" by George Wynne and Andrew B. Duncan.

Language:Jupyter NotebookStargazers:7Issues:0Issues:0
License:Apache-2.0Stargazers:7Issues:0Issues:0

level-replay

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

Language:PythonLicense:NOASSERTIONStargazers:82Issues:0Issues:0

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonLicense:Apache-2.0Stargazers:1485Issues:0Issues:0

llm_optimization

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language:Jupyter NotebookLicense:MITStargazers:483Issues:0Issues:0

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:3665Issues:0Issues:0

bandits

Bayesian Bandits

Language:Jupyter NotebookLicense:MITStargazers:63Issues:0Issues:0

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5096Issues:0Issues:0

OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonLicense:Apache-2.0Stargazers:3810Issues:0Issues:0

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:1257Issues:0Issues:0

TextWorld

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1173Issues:0Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:727Issues:0Issues:0

rap-rank-reconstruction

Code for reproducing https://arxiv.org/abs/2211.03128

Language:PythonStargazers:10Issues:0Issues:0

deep-Q-networks

Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51

Language:Jupyter NotebookStargazers:263Issues:0Issues:0

ml_collections

ML Collections is a library of Python Collections designed for ML use cases.

Language:PythonLicense:Apache-2.0Stargazers:868Issues:0Issues:0

xmanager

A platform for managing machine learning experiments

Language:PythonLicense:Apache-2.0Stargazers:812Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15541Issues:0Issues:0

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonLicense:MITStargazers:389Issues:0Issues:0

modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:211Issues:0Issues:0

hands-on-rl

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Language:Jupyter NotebookLicense:MITStargazers:1033Issues:0Issues:0

awesome-mlss

🤖 Machine Learning Summer School deadlines

Language:HTMLLicense:MITStargazers:2634Issues:0Issues:0

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5910Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35267Issues:0Issues:0
Language:PythonLicense:MITStargazers:141Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9548Issues:0Issues:0

rl-starter-files

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

Language:PythonLicense:MITStargazers:638Issues:0Issues:0

commonsense-rl

Knowledge-Aware RL agents with Commonsense Reasoning

Language:Inform 7License:Apache-2.0Stargazers:75Issues:0Issues:0

TFDMNet

Learning Convolutional Neural Networks in the Frequency Domain

Language:PythonStargazers:11Issues:0Issues:0

DIAYN-PyTorch

Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.

Language:PythonLicense:MITStargazers:56Issues:0Issues:0