v2 - thoughts

by yano2mch - opened Jul 31

Jul 31

Seems to work overall. I've noticed small issues with it.

got a Japanese word once but with temp under 0.8 i haven't seen a repeat.

With multiple characters talking it can easily output hundreds of tokens in text that looks... pretty darn decent.

Owner Aug 1

Thanks for the feedback! I'm about 2 days off the new qwq-32b-v2 finetune and then will likely try again for the v3 version of this model.

Aug 1

Mhmm... Going a bit longer i've gotten a few Japanese words thrown in, usually referring to body parts.

Owner 7 days ago

Not sure if you've seen it but I uploaded v3 here a couple of days ago:

It's mostly fixed the weird spacing after newlines, but now seems to add weird extra spaces after period symbols instead :/

In theory it should be much better than the v1 or v2 models as it was trained on nearly 2.5B tokens (around 10x more).

6 days ago

Not sure if you've seen it but I uploaded v3 here a couple of days ago:

https://huggingface.co/jukofyork/command-r-35b-writer-v3

Yeah i see an mradermacher quant. I'll get it and try it soon. :)

yano2mch changed discussion status to closed 6 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment