codrutalugoj's starred repositories

HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Language:Jupyter NotebookLicense:MITStargazers:205Issues:0Issues:0

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:1982Issues:0Issues:0
Language:PythonLicense:MITStargazers:211Issues:0Issues:0

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:2129Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:88310Issues:0Issues:0

rome

Locating and editing factual associations in GPT (NeurIPS 2022)

Language:PythonLicense:MITStargazers:517Issues:0Issues:0

communitynotes

Documentation and source code powering Twitter's Community Notes

Language:PythonLicense:Apache-2.0Stargazers:1373Issues:0Issues:0

ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

Language:HTMLStargazers:168Issues:0Issues:0

RLAlgorithms

Reinforcement learning algorithms, produced mostly or entirely from scratch.

Language:Jupyter NotebookStargazers:3Issues:0Issues:0
License:CC-BY-4.0Stargazers:214Issues:0Issues:0

courses

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

Language:PythonStargazers:5033Issues:0Issues:0

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

Stargazers:9297Issues:0Issues:0
Language:PythonLicense:MITStargazers:19623Issues:0Issues:0

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonLicense:MITStargazers:1154Issues:0Issues:0

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookLicense:MITStargazers:2293Issues:0Issues:0

manim

Animation engine for explanatory math videos

Language:PythonLicense:MITStargazers:60175Issues:0Issues:0

varibad

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

Language:PythonLicense:NOASSERTIONStargazers:175Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:13Issues:0Issues:0