"We took great care to optimize helpfulness and safety."

#2
by Novelo - opened

Sounds like this is gonna be one Undi-Incompatible model full of censorship

At least it isn't codellama-70b-instruct level of "safety" - so safe it didn't want to write any code :D

Sounds like this is gonna be one Undi-Incompatible model full of censorship

Judging by Reddit, on the contrary, even with the assistant prompt, it does not refuse a large number of requests that Llama 2 would never answer.

The level of censorship is noticeably lower than in Llama 2. And there are also few refusals in sillytavern with jailbreak.

Owner

We already done a test finetune with @IkariDev and despite the model being dumb (we trained on base), I could do some hardcore shit (ahhh... test phase lmao) so I think it will be possible.

I've been doing some testing myself after starting the thread. Jailbreaking seems to let it loose, though sometimes, after perhaps 200 or so tokens output, it can suddenly refuse and keep echoing its refusal. In my personal opinion, MiquMaid-v3-70B beats it for the things I play with, didn't find it to be smart at all.

Sign up or log in to comment