distily_smollm_dataset_sweep / benchmarks.shelve.dir
lapp0's picture
Training in progress, step 5000
605ccf7 verified
raw
history blame
222 Bytes
'logs/teacher', (0, 448)
'distily_smollm_dataset_sweep/logs/dataset_max_seq_length=1024, dataset_sample_size=1000000, dataset_subset=20231101.en, dataset_uri=wikimedia_wikipedia, per_device_train_batch_size=8', (512, 448)