Running on A100 261 261 mergekit-gui 🔀 Merge machine learning models using a YAML configuration file
Thea Collection A family of compact reasoning models, based off of the best 2B and 3B models, trained using improved DDP training code, no Unsloth. • 5 items • Updated Jan 22 • 1
Thea Collection A family of compact reasoning models, based off of the best 2B and 3B models, trained using improved DDP training code, no Unsloth. • 5 items • Updated Jan 22 • 1