chandar-lab

chandar-lab

Geek Repo

Github PK Tool:Github PK Tool

chandar-lab's repositories

Language:PythonLicense:MITStargazers:100Issues:9Issues:96

Recall2Imagine

Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024

Language:PythonLicense:MITStargazers:44Issues:7Issues:10
Language:PythonLicense:MITStargazers:38Issues:6Issues:2

IIRC

IIRC: Incremental Implicitly Refined Classification

Language:PythonLicense:MITStargazers:31Issues:3Issues:1

Lifelong-Hanabi

A Continual Multi-agent RL testbed based on Hanabi

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:30Issues:4Issues:2
Language:PythonLicense:MITStargazers:6Issues:4Issues:3
Language:PythonLicense:MITStargazers:5Issues:5Issues:1
Language:PythonLicense:MITStargazers:5Issues:4Issues:0

CriticalGradientOptimization

Critical Gradient Optimization.

Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0

EpiK-Eval

Benchmark to evaluate the capability of language models to consolidate and recall information from multiple training documents.

Language:PythonLicense:MITStargazers:4Issues:3Issues:0
Language:PythonLicense:MITStargazers:3Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

healthy-data-diet

Reduce gender bias in machine learning models.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:JavaStargazers:2Issues:1Issues:0

tgi-for-mila

A toolkit for running text-generation-inference on Mila and Compute Canada

Language:ShellLicense:MITStargazers:2Issues:1Issues:3
Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

crystal-design

Reinforcement Learning for Crystal Structure Design

Language:PythonStargazers:1Issues:0Issues:0

FASP

We study the effect of attention head pruning on fairness in large language models

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

SubGoal_Distillation_LLM

Code for the paper Sub-goal Distillation: A Method to Improve Small Language Agents

Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Lookbehind-SAM

Implementation of Lookbehind-SAM: k steps back, 1 step forward (ICML 2024)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0