Text Generation
Transformers
Safetensors
English
olmo2
conversational
Inference Endpoints

What is that instruction template?

#1
by SerialKicked - opened

What is that instruction template? It makes very little sense. Your model has ChatML being fully tokenized but you don't even use it, instead you use non tokenized markers. It has only 4096 context length AND you're wasting half on it on the instruction template? I don't get it.

Sign up or log in to comment