adamo1139/Yi-34B-200K-HESOYAM-2206

Basic Model Info

1 epoch on adamo1139/uninstruct-v1-experimental-chatml and then 1 epoch on adamo1139/HESOYAM_v0.3. I used GaLore for both stages.

This is a model trained on only human data, finetuned to behave like a person on 4chan board /x/ or redditor. Data used has comments from 1 4chan board "paranormal" and about 10 reddit subreddits. There's also a pippa in case you want to roleplay. Have a look at dataset to know what to expect.

Use ChatML prompt format with a system prompt like those in adamo1139/HESOYAM_v0.3, so A chat on 4chan or A chat on subreddit /r/wallstreetbets. It behaves like OpenAI slopped model with system prompt A chat so I advise you to avoid using that.

adamo1139
/

Yi-34B-200K-HESOYAM-2206

Basic Model Info

Datasets used to train adamo1139/Yi-34B-200K-HESOYAM-2206