Juan Elenter's repositories

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ActiveWireless

Active Learning in wireless networks

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

site

Personal Website

Language:HTMLStargazers:0Issues:1Issues:0

cooper

A general-purpose, deep learning-first library for constrained optimization in PyTorch

License:MITStargazers:0Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

feas-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

License:Apache-2.0Stargazers:0Issues:0Issues:0

isps

Looking for Interpolating, Stochastic Polyak Step Size Optimizer

Language:PythonStargazers:0Issues:0Issues:0

juanelenter

Config files for my GitHub profile.

Stargazers:0Issues:0Issues:0

LAMDA-PILOT

🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

minRLHF

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

Language:PythonStargazers:0Issues:0Issues:0

mseqgen

Multi task batch generator for training deep learning models on CHIP-seq, CHIP-exo, CHIP-nexus, ATAC-seq, RNA-seq (or any other -seq)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nearopt

Near Optimal Solutions of Constrained Learning Problems

Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:1Issues:0

PolyakInterpolation

Empirical analysis of Interpolation with Polyak Stepsizes

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

raive

Stable diffusion for real-time music generation

License:MITStargazers:0Issues:0Issues:0

WirelessAlly

Active sampling of network configurations for GNNs

Language:PythonStargazers:0Issues:0Issues:0