tim-lawson's picture
Push model using huggingface_hub.
e544360 verified
{
"auxk": 256,
"dead_steps_threshold": 76,
"dead_threshold": 0.001,
"k": 32,
"n_inputs": 768,
"n_latents": 49152,
"standardize": false
}