EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.

EleutherAI/elk Issues

pip install from git doesn't work
Updated 4 months ago
Make into a `pytest.fixture` which was a TODO in `test_smoke_eval.py`
Updated 5 months ago
Evaluate LM output performance in `elk eval`
Closed 10 months ago
--max_examples seems to be broken
Closed a year ago2
Use shrinkage for (cross-)covariance estimation
Updated a year ago3
Cannot run README command `elk elicit microsoft/deberta-v2-xxlarge-mnli imdb`
Updated a year ago1
Encoder-only models do not expect `labels` argument to be passed to forward
Closed a year ago2
Write smoke test for eval
Updated a year ago
Support Neel Nanda's counterfact dataset
Updated a year ago3
Can't run elicit and sweep; process is stuck with message: "Waiting for x GPUs with at least X GB of free memory. 0 GPUs currently available."
Closed a year ago2
Smoke tests for elk eval command
Closed a year ago4
Use dill and Apache Arrow directly for caching & storing hidden states
Updated a year ago1
Install is broken with python 3.11
Closed a year ago
Can't save datasets with elk extract
Closed a year ago5
No `max_gpus` argument for `elk extract` command
Closed a year ago1
Hyperparameter sweeps with Optuna or Ray Tune
Closed a year ago1
Story cloze doesn't work anymore
Closed a year ago4
Bootstrap CIs for AUROC metrics
Closed a year ago1
Add transfer eval to sweep
Closed a year ago1
Add way of specifying nondefault hparams that are shared across sweep
Closed a year ago1
Multiple choice datasets encounter a tensor error
Updated a year ago
Add CLI flag to not use cached hiddens
Closed a year ago
Save hidden states in bfloat16
Updated a year ago
Combine prompts to evaluate multilingual prompt invariance
Closed a year ago
simplify eval command syntax
Closed a year ago1
Make normalization a property of the `Reporter`
Closed a year ago
Better error messages when a worker crashes during extraction
Closed a year ago1
Weights & Biases integration
Closed a year ago2
Evaluate models' zero-shot accuracy
Closed a year ago1
Hidden state cache shouldn't be invalidated when we're using different devices
Closed a year ago
Add LR stats to transfer eval
Closed a year ago
save elk eval runs separately
Closed a year ago
Support exponential moving averages for the covariance statistics on EigenReporter
Updated a year ago
Switch to IterableDataset
Closed a year ago
Switch to ruff for linting
Closed a year ago
Allow min_memory to be passed by CLI
Closed a year ago1
train_reporter should return a dataclass of stats instead of a list
Closed a year ago
Make num_heads property on EigenReporter accessible via CLI
Updated a year ago
Hello World
Closed a year ago
Add pre-computed hidden states to Git LFS and use them in unit tests
Closed a year ago
Allow saving multiple reporters with diff. hyperparameters for the same hidden states
Closed a year ago
Support regularization on classifier
Closed a year ago1
Use `Reporter` for the supervised baseline, not sklearn `LogisticRegression`
Closed a year ago
Add automatic type checking to the linting pipeline
Closed a year ago1
Support few-shot prompts
Closed a year ago1
Add docstrings to all user-facing APIs
Closed a year ago1
Create an "elk list" command to show info about all cached runs
Closed a year ago
Error when max examples larger than examples in ds
Closed a year ago1
Plotting for individual predictions across depth
Updated a year ago
Rename "CCS" class to "Reporter"
Closed a year ago