juanelenter

Juan Elenter's repositories

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0100

ActiveWireless

Active Learning in wireless networks

Language:Python000

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:PythonMIT000

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Language:PythonMIT000

cooper

A general-purpose, deep learning-first library for constrained optimization in PyTorch

MIT000

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Apache-2.0000

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Apache-2.0000