collapse_gemma-2-2b_hs2_accumulatesubsample_iter14_sftsd1

This model is a fine-tuned version of google/gemma-2-2b on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Input Tokens Seen
No log	0	0	1.3909	0
1.395	0.0528	5	1.2762	263616
1.025	0.1057	10	1.2032	522192
0.9043	0.1585	15	1.2117	781320
0.7869	0.2114	20	1.2530	1049592
0.7435	0.2642	25	1.2754	1306912
0.613	0.3170	30	1.2796	1574208
0.5984	0.3699	35	1.2686	1838000
0.5522	0.4227	40	1.2501	2102768
0.3169	0.4756	45	1.2364	2356712
0.4495	0.5284	50	1.2186	2617488
0.3906	0.5812	55	1.2323	2878520
0.3294	0.6341	60	1.2076	3138624
0.4019	0.6869	65	1.2202	3399776
0.3896	0.7398	70	1.2076	3658416
0.3273	0.7926	75	1.2138	3928424
0.3961	0.8454	80	1.2004	4193672
0.3151	0.8983	85	1.2016	4458168
0.3865	0.9511	90	1.1975	4728256