sft-zephyr-7b-beta-vi-math-v1 / train_results.json
hllj's picture
Model save
d4a23ae
{
"epoch": 2.5,
"train_loss": 0.4085692544042328,
"train_runtime": 1490.1076,
"train_samples": 1076,
"train_samples_per_second": 2.166,
"train_steps_per_second": 0.542
}