yiran-wang3
/

ds_chat_no_mask_sppo_hard_new_iter0_reproduce

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

ds_chat_no_mask_sppo_hard_new_iter0_reproduce / generation_config.json

yiran-wang3's picture

Model save

789baa0 verified about 2 months ago

history blame contribute delete

181 Bytes

	{
	"_from_model_config": true,
	"bos_token_id": 100000,
	"do_sample": true,
	"eos_token_id": 100001,
	"temperature": 0.7,
	"top_p": 0.95,
	"transformers_version": "4.42.0"
	}