Simon Lermen (DalasNoin)

DalasNoin

Geek Repo

Location:the world

Home Page:simonlermen.github.io

Twitter:@SimonLermenAI

Github PK Tool:Github PK Tool

Simon Lermen's repositories

redteaming

redteaming a simple language model like gpt2. based on anthropic redteaming paper

Language:PythonStargazers:7Issues:3Issues:0

exploring_modelgraded_evaluation

exploring model-graded evaluation

Language:TeXStargazers:4Issues:2Issues:0

arena-ldn

London in Person exercises

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

gpt-tools

tools for the openai api

Language:PythonStargazers:1Issues:2Issues:0

safety_benchmarks

Safety Benchmarks such as Refusal Bench

SVDInterpretTransformer

Apply SVD to Transformer weights

Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

ACER

Actor-critic with experience replay

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

arena

My solutions for the arena course

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

PySvelte

A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

DecisionTransformerInterpretability

Interpreting how transformers simulate agents performing RL tasks

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LM-exp

LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mlab

Machine Learning for Alignment Bootcamp

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

MLAB-Transformers-From-Scratch

Reimplementing transformers from scratch (from Redwood Research's Machine Learning for Alignment Bootcamp).

Language:PythonStargazers:0Issues:0Issues:0

python-binance

Binance Exchange API python implementation for automated trading

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

reference_chatbot

In-Context Retrieval-Augmented Language Models AI21labs Implementation

Stargazers:0Issues:1Issues:0

refusal_direction

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

simple-llama-finetuner

Simple UI for LLaMA Model Finetuning

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

TextWorld

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

weblm

Drive a browser with a language model

Language:PythonLicense:MITStargazers:0Issues:0Issues:0