athirdpath
/

Llama-3.1-Techne-RP-8b-v1

Text Generation

Not-For-All-Audiences

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3.1-Techne-RP-8b-v1 / README.md

athirdpath's picture

Update README.md

28e3d26 verified 4 months ago

|

978 Bytes

	---
	license: llama3.1
	---

	-----

	<p align="center"><font size="5"> <b>Assistant Example @ q5_k_m</b> </font></p>

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/dN45v5YHdIVyOacRx4xSc.png)

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/8qE0ikdtibgFtMZ-SVH1P.png)

	-----

	<p align="center"><font size="5"> <b>NSFW Writing Example @ q5_k_m</b> </font></p>

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/exle9vh1IFoKnAcIL1D64.png)

	-----

	<p align="center"><font size="5"> <b>Training Methodology</b> </font></p>

	athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit was further trained in the order below:

	## SFT
	- Doctor-Shotgun/no-robots-sharegpt
	- grimulkan/LimaRP-augmented
	- Inv/c2-logs-cleaned-deslopped

	## DPO
	- jondurbin/truthy-dpo-v0.1
	- Undi95/Weyaxi-humanish-dpo-project-noemoji
	- athirdpath/DPO_Pairs-Roleplay-Llama3-NSFW