ewe666
/

small-rp-models

Text Generation

Model card Files Files and versions Community

ewe666 commited on 8 days ago

Commit

2ddb423

·

verified ·

1 Parent(s): 68a8689

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -4,11 +4,20 @@ pipeline_tag: text-generation
 Collection of resources and models for storytelling and roleplay. Updated December 2024.
 # ⚒️ Base models
 - Llama 3 (8B) - the OG
 - [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) (12B)
-- [Qwen2.5-14B](https://huggingface.co/Qwen/Qwen2.5-14B)
 # 🤖 Instruct models

 Collection of resources and models for storytelling and roleplay. Updated December 2024.
+**Current favorite**: [mistralai/Mistral-Small-24B-Instruct-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501)
+Some notes on best usage:
+- some people prefer base models over instruct models, but base models are too unruly
+- in general, roleplay finetunes I find to be braindamaged
+- you also don't want to "overparameterize" by writing too long a prompt
+- Conclusion: use original instruct models with short prompts
 # ⚒️ Base models
 - Llama 3 (8B) - the OG
 - [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) (12B)
+- Qwen2.5
+- Mistral Small
 # 🤖 Instruct models