metadata

language:
  - en
license: llama3
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - gguf
base_model: unsloth/llama-3-8b-bnb-4bit
datasets:
  - 922-Narra/tagaloguanaco_cleaned_03152024

Llama-3-8b-tagalog-v1:

Test model fine-tuned on this dataset
Base: LLaMA-3 8b
GGUFs

USAGE

This is meant to be mainly a chat model.

Use "Human" and "Assistant" and prompt with Tagalog:

"\nHuman: INPUT\nAssistant:"

HYPERPARAMS

Trained for 1 epochs
rank: 32
lora alpha: 32
lr: 2e-4
batch size: 2
grad steps: 4

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

WARNINGS AND DISCLAIMERS

Note that there is a chance that the model may switch back to English (albeit still understand Tagalog inputs) or output clunky results.

Finally, this model is not guaranteed to output aligned or safe outputs nor is it meant for production use - use at your own risk!