Text Generation
Transformers
Safetensors
llama
galore
text-generation-inference
Inference Endpoints
Edit model card

Basic Model Info

1 epoch on adamo1139/uninstruct-v1-experimental-chatml and then 1 epoch on adamo1139/HESOYAM_v0.3. I used GaLore for both stages.

This is a model trained on only human data, finetuned to behave like a person on 4chan board /x/ or redditor. Data used has comments from 1 4chan board "paranormal" and about 10 reddit subreddits. There's also a pippa in case you want to roleplay. Have a look at dataset to know what to expect.

Use ChatML prompt format with a system prompt like those in adamo1139/HESOYAM_v0.3, so A chat on 4chan or A chat on subreddit /r/wallstreetbets. It behaves like OpenAI slopped model with system prompt A chat so I advise you to avoid using that.

Downloads last month
10
Safetensors
Model size
34.4B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train adamo1139/Yi-34B-200K-HESOYAM-2206