ewe666 commited on
Commit
2ddb423
·
verified ·
1 Parent(s): 68a8689

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -4,11 +4,20 @@ pipeline_tag: text-generation
4
 
5
  Collection of resources and models for storytelling and roleplay. Updated December 2024.
6
 
 
 
 
 
 
 
 
 
7
  # ⚒️ Base models
8
 
9
  - Llama 3 (8B) - the OG
10
  - [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) (12B)
11
- - [Qwen2.5-14B](https://huggingface.co/Qwen/Qwen2.5-14B)
 
12
 
13
  # 🤖 Instruct models
14
 
 
4
 
5
  Collection of resources and models for storytelling and roleplay. Updated December 2024.
6
 
7
+ **Current favorite**: [mistralai/Mistral-Small-24B-Instruct-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501)
8
+
9
+ Some notes on best usage:
10
+ - some people prefer base models over instruct models, but base models are too unruly
11
+ - in general, roleplay finetunes I find to be braindamaged
12
+ - you also don't want to "overparameterize" by writing too long a prompt
13
+ - Conclusion: use original instruct models with short prompts
14
+
15
  # ⚒️ Base models
16
 
17
  - Llama 3 (8B) - the OG
18
  - [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) (12B)
19
+ - Qwen2.5
20
+ - Mistral Small
21
 
22
  # 🤖 Instruct models
23