cls_finred_phi3_v1

This model is a fine-tuned version of microsoft/Phi-3-mini-4k-instruct on the generator dataset. It achieves the following results on the evaluation set:

Loss: 0.4678

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0002
train_batch_size: 2
eval_batch_size: 8
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 8
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: constant
lr_scheduler_warmup_ratio: 0.03
num_epochs: 2
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss
0.8648	0.1046	20	0.7698
0.702	0.2092	40	0.6944
0.7014	0.3137	60	0.6538
0.6379	0.4183	80	0.6318
0.6547	0.5229	100	0.6054
0.6044	0.6275	120	0.5934
0.597	0.7320	140	0.5759
0.5779	0.8366	160	0.5620
0.5244	0.9412	180	0.5530
0.477	1.0458	200	0.5505
0.4465	1.1503	220	0.5407
0.4669	1.2549	240	0.5303
0.5193	1.3595	260	0.5250
0.4398	1.4641	280	0.5169
0.4571	1.5686	300	0.5071
0.4175	1.6732	320	0.4953
0.4037	1.7778	340	0.4850
0.4418	1.8824	360	0.4764
0.4136	1.9869	380	0.4678

Framework versions

PEFT 0.11.1
Transformers 4.41.1
Pytorch 2.3.0+cu121
Datasets 2.19.1
Tokenizers 0.19.1

Sorour
/

cls_finred_phi3_v1

cls_finred_phi3_v1

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Model tree for Sorour/cls_finred_phi3_v1

Evaluation results