metadata
library_name: transformers
tags:
- tinyzero
- r1
license: mit
language:
- en
base_model:
- Qwen/Qwen2.5-3B
pipeline_tag: text-generation
Bityuno Zero Qwen2.5-3B Countdown
Bityuno Zero is an implementation inspired by TinyZero, designed to develop self-verification and search skills through reinforcement learning. This model is based on Qwen2.5-3B and has been specifically trained for the "Countdown" task, its so experimental, check the repo for more information!