%0 Computer Program
%A Altmeyer, Patrick
%A Liem, C.C.S. (Cynthia)
%A Demetriou, Andrew
%A Bartlett, Antony
%D 2025
%T Software and data underlying the publication: "Position: Stop Making Unscientific AGI Performance Claims"
%U
%R 10.4121/d427d182-4bb0-4972-980c-adcb28f430b6.v1
%K Artificial Intelligence
%K Artificial General Intelligence
%K Mechanistic Interpretability
%K Interpretability
%X
Code and research results for ICML 2024 position paper. Originally released here: https://github.com/pat-alt/spurious_sentience.
The research results include:
- Regression tables (.tex; .html)
- An "evaluations.csv" file that contains estimated evaluation metrics for linear probes and the baseline grouped by indicator, layer (network layer), train/test split, variable (measure), model (lin. probe/baseline).
- A figures/ folder containing all PNG figures that went into a) the body or b) the appendix.
- An interim/ folder containing results for probe predictions for each training epoch.
- An attacks/ folder containing the CSV files of neural network activations for attack prompts (see paper for details). Additionally, this folder contains a sentences/ subfolder with the actual textual attack prompts (.txt files).
%I 4TU.ResearchData