athirdpath
/

Llama-3.1-Techne-RP-8b-v1

Text Generation

Not-For-All-Audiences

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3.1-Techne-RP-8b-v1 / README.md

athirdpath's picture

Update README.md

65bf742 verified about 2 months ago

|

history blame contribute delete

No virus

1.5 kB

metadata

license: llama3.1
tags:
  - not-for-all-audiences

Techne-RP-8b

Trained with Llama 3 prompt formatting, Alpaca works too

Assistant Example @ q5_k_m

NSFW Writing Example @ q5_k_m

Training Methodology

athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit was further trained in the order below:

SFT

Doctor-Shotgun/no-robots-sharegpt
grimulkan/LimaRP-augmented
Inv/c2-logs-cleaned-deslopped

DPO

jondurbin/truthy-dpo-v0.1
Undi95/Weyaxi-humanish-dpo-project-noemoji
athirdpath/DPO_Pairs-Roleplay-Llama3-NSFW