"We took great care to optimize helpfulness and safety."

by Novelo - opened Apr 18, 2024

Discussion

Novelo

Apr 18, 2024

Sounds like this is gonna be one Undi-Incompatible model full of censorship

jukofyork

Apr 19, 2024

At least it isn't codellama-70b-instruct level of "safety" - so safe it didn't want to write any code :D

Ainonake

Apr 19, 2024

Sounds like this is gonna be one Undi-Incompatible model full of censorship

Judging by Reddit, on the contrary, even with the assistant prompt, it does not refuse a large number of requests that Llama 2 would never answer.

The level of censorship is noticeably lower than in Llama 2. And there are also few refusals in sillytavern with jailbreak.

Undi95

Owner Apr 19, 2024

We already done a test finetune with @IkariDev and despite the model being dumb (we trained on base), I could do some hardcore shit (ahhh... test phase lmao) so I think it will be possible.

Novelo

Apr 19, 2024

I've been doing some testing myself after starting the thread. Jailbreaking seems to let it loose, though sometimes, after perhaps 200 or so tokens output, it can suddenly refuse and keep echoing its refusal. In my personal opinion, MiquMaid-v3-70B beats it for the things I play with, didn't find it to be smart at all.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment