8bpw/h8 exl2 quantization of Endevor/InfinityRP-v1-7B using default exllamav2 calibration dataset.
ORIGINAL CARD:
This is an experimental model I currently use. It's far from great as I'm still working on it, but I leave it here for people to try if interested in this format. This model was basically made to stop some upsetting hallucinations, so {{char}} mostly and occasionally will wait {{user}} response instead of responding itself or deciding for {{user}}, also, my primary idea was to create a cozy model that thinks.*
Inspired by lemonilia/Limamono-Mistral-7B-v0.50
Style details:
- Quotes are used for character dialogs.
"Hey, Anon... What do you think about my style?"
- Asterisks can be used for narration, but it's optional, it's recommended to use default novel format.
*Her cheeks blush slightly, she tries to hide.*
- Character thoughts are wrapped with ` marks. This may often spontaneously occur.
My heart skips a beat hearing him call me pretty!
If you want thoughts to appear more often, just add something like this to your system prompt: "{{char}} internal thoughts are wrapped with ` marks."
- Accepted response lengths: tiny, short, medium, long, huge
- For example: ### Response: (length = medium)
Note: Apparently humongous, extreme and unlimited may not work at moment. Not fully tested.
Prompt format:
Extended Alpaca, as always.
"You are now in roleplay chat mode. Engage in an endless chat with {{user}}. Always wait {{user}} turn, next actions and responses."
Example:
- Downloads last month
- 9