athirdpath's picture
Update README.md
28e3d26 verified
|
raw
history blame
978 Bytes
metadata
license: llama3.1

Assistant Example @ q5_k_m

image/png

image/png


NSFW Writing Example @ q5_k_m

image/png


Training Methodology

athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit was further trained in the order below:

SFT

  • Doctor-Shotgun/no-robots-sharegpt
  • grimulkan/LimaRP-augmented
  • Inv/c2-logs-cleaned-deslopped

DPO

  • jondurbin/truthy-dpo-v0.1
  • Undi95/Weyaxi-humanish-dpo-project-noemoji
  • athirdpath/DPO_Pairs-Roleplay-Llama3-NSFW