sheikheddy

User data from Github https://github.com/sheikheddy

followers

following

stars

Dubai, UAE

Sheikh Abdur Raheem Ali's repositories

AEP_OOD_evaluation

MIT000

ARENA_3.0

Language:Jupyter Notebook000

CAA

Steering Llama 2 with Contrastive Activation Addition

Language:Jupyter NotebookMIT000

base-models-refuse

Code to reproduce key results accompanying "Base LLMs refuse too"

Language:Python000

claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

NOASSERTION000

crosscoder-model-diff-replication

Open source replication of Anthropic's Crosscoders for Model Diffing

000

devinterp

Quantifying degeneracy in toy models

Language:Python000

dictionary_learning

Language:Python000

dotfiles

my personal terminal configurations for alignment research engineering

000

fae

(jax, tpu) nf4 matmuls for flux + t5 + onnx vae. vision SAE training and maxacts

Language:PythonMIT000

flux-saes-gpu

000

guarantees-based-mechanistic-interpretability

Language:Jupyter NotebookMIT000

hoyolab-auto-daily

Easiest, full free, and no BS Hoyolab daily check-in using GitHub Actions. Supports Zenless Zone Zero, Honkai: Star Rail, Genshin Impact, Honkai Impact 3rd, and Tears of Themis.

MIT000

Language-Model-SAEs

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.

Language:Jupyter Notebook000

llm-viz

3D Visualization of an GPT-style LLM

000

marc

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Language:PythonMIT000

mats

Language:PythonMIT000

motion-canvas

Visualize Your Ideas With Code

Language:TypeScriptMIT000

sache

000

sae

Sparse autoencoders

MIT000

sae-rm

Using SAE's to interpret Reward Models (RM)

000

sae-topk-by-abs

010

SAE-TS

Improving Steering Vectors by Targeting Sparse Autoencoder Features

Language:PythonMIT000

sae_eval

Language:Jupyter NotebookMIT000

sapiens

High-resolution models for human tasks.

Language:PythonNOASSERTION000

semantic-router

Superfast AI decision making and intelligent processing of multi-modal data.

Language:PythonMIT000

sheikheddy.github.io

My personal website

Language:HTML000

slt

Tools for studying developmental interpretability in neural networks.

Language:Python000

SwitchTransformers

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

MIT000

unit

Next Generation Visual Programming System

MIT000