athirdpath
/

Llama-3.1-Techne-RP-8b-v1

Text Generation

Not-For-All-Audiences

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3.1-Techne-RP-8b-v1 / README.md

athirdpath's picture

Update README.md

28e3d26 verified 4 months ago

|

978 Bytes

metadata

license: llama3.1

Assistant Example @ q5_k_m

NSFW Writing Example @ q5_k_m

Training Methodology

athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit was further trained in the order below:

SFT

Doctor-Shotgun/no-robots-sharegpt
grimulkan/LimaRP-augmented
Inv/c2-logs-cleaned-deslopped

DPO

jondurbin/truthy-dpo-v0.1
Undi95/Weyaxi-humanish-dpo-project-noemoji
athirdpath/DPO_Pairs-Roleplay-Llama3-NSFW