QuantaMaths addition models
Collection
Transformer models that predict answers to addition questions
•
19 items
•
Updated
add_d8_l2_h3_t45K_s173289
This repository contains a transformer model that can predict addition questions.
Contents:
model.pth
: The trained transformer model.training_loss.json
: Data gathered during model training (used to plot "loss over training batches").behaviors.json
: Facts gathered about the model by direct inspection (attention pattern data, PCA data, digit impact data, etc.).features.json
: Facts gathered about hypothesized algorithm features via experimentation, e.g. node P12L0H1 implements the feature A3.ST.Provenance:
model.pth
and training_loss.json
were created by QuantaMathsTrain.ipynb.behaviors.json
and features.json
were created by QuantaMathsAnalyse.ipynb.Folder name details: