Beast code in Giters

codrutalugoj's starred repositories

HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Language:Jupyter NotebookMIT20500

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonApache-2.0198200

MLAgentBench

Language:PythonMIT21100

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonMIT212900

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonMIT8831000

rome

Locating and editing factual associations in GPT (NeurIPS 2022)

Language:PythonMIT51700

communitynotes

Documentation and source code powering Twitter's Community Notes

Language:PythonApache-2.0137300

ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

Language:HTML16800

RLAlgorithms

Reinforcement learning algorithms, produced mostly or entirely from scratch.

Language:Jupyter Notebook300

evals

CC-BY-4.021400

courses

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

Language:Python503300

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

929700

babyagi

Language:PythonMIT1962300

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonMIT115400

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookMIT229300

manim

Animation engine for explanatory math videos

Language:PythonMIT6017500

varibad

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

Language:PythonNOASSERTION17500

hyperx

Language:PythonNOASSERTION1300