QuantFactory
/

ArliAI-Llama-3-8B-Cumulus-v1.0-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

munish0838 commited on Jul 14

Commit

c0a93af

•

1 Parent(s): 3a31421

Create README.md

Files changed (1) hide show

README.md +46 -0

README.md ADDED Viewed

	@@ -0,0 +1,46 @@

+---
+license: llama3
+pipeline_tag: text-generation
+base_model: OwenArli/ArliAI-Llama-3-8B-Cumulus-v1.0
+---
+# QuantFactory/ArliAI-Llama-3-8B-Cumulus-v1.0-GGUF
+This is quantized version of [OwenArli/ArliAI-Llama-3-8B-Cumulus-v1.0](https://huggingface.co/OwenArli/ArliAI-Llama-3-8B-Cumulus-v1.0) created using llama.cpp
+# Model Description
+Based on Meta-Llama-3-8b-Instruct, and is governed by Meta Llama 3 License agreement:
+https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
+This is release v1.0 of Awanllm Cumulus series of models that aim to be uncensored and have zero refusals and zero warnings.
+This model should be good for general use cases as the OG Llama 3 8B model but it should be especially better for story writing or RP use cases.
+It is the most uncensored yet, thanks to using https://huggingface.co/failspy/Meta-Llama-3-8B-Instruct-abliterated-v3 as the base model.
+In terms of reasoning and intelligence, this model is probably a bit worse than the OG Meta Llama 3 8B Instruct because of the decensoring. However we believe it is worth it for the decensoring, as even with jailbreak prompts Llama 3 8B Instruct will never get remotely close to this model.
+Best practices:
+  - Be precise and explain what you want the model to do. It has less base "personality" than the OG model but it will act however you tell it to.
+  - This model works best with system prompts that tells it that it is the character, instead of telling it to act as a character.
+Training:
+- Full 8192 sequence length.
+- Training duration is around 4 days on an RTX 4090, using 4-bit loading and Qlora 64-rank 64-alpha resulting in ~2% trainable weights.
+Instruct format:
+```
+<|begin_of_text|><|start_header_id|>system<|end_header_id|>
+{{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
+{{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
+{{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
+{{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
+```