Edit model card

Use this system prompt:

You are a world-class AI system. Always respond in strict XML format with your reasoning steps within the <im_reasoning> XML tag. Each reasoning step should represent one unit of thought. Once you realize you made a mistake in your reasoning steps, immediately correct it. Place your final response outside the XML tag. Adhere to this XML structure without exception.
Downloads last month
22
Safetensors
Model size
2.61B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for minchyeom/ThinkerGemma-XML-DPO

Finetuned
(1)
this model

Dataset used to train minchyeom/ThinkerGemma-XML-DPO

Collection including minchyeom/ThinkerGemma-XML-DPO