Naming scheme

by Vezora - opened Apr 25

Discussion

Vezora

Apr 25

Out of curiosity why was the naming scheme “base-7b-v0.2” when the mistral v0.1 was used for continued pre-training?

Either way congratulations on the model, this is seriously awesome. I love it! And thank you for apache 2.0! ❤️🤗👏

maximegmd

Internist.ai org Apr 25

Hello,

Our naming scheme probably isn't the best, we actually had a v0.1 that was finetuned using another format of benchmarks. Since lm-eval-harness has since then implemented the benchmarks natively we had to finetune it again with the expected format.

Thank you very much for your kind words!

Vezora

Apr 26

Oh I understand, that makes sense! Once again, thank you, y'all are so awesome for this model! ❤️

Vezora changed discussion status to closed Apr 26

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment