afrideva
/

refusal-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Edit model card

refusal-GGUF

Quantized GGUF model files for refusal from mrfakename

Original Model Card:

I messed up on the previous model. This is a fixed version.

A tiny 1B model that refuses basically anything you ask it! Trained on the refusal dataset. Prompt format is ChatML.

Training results:

Training Loss	Epoch	Step	Validation Loss
2.4352	0.0580	1	2.4462
1.5741	0.5217	9	1.4304
1.5204	1.0435	18	1.3701
1.0794	1.5217	27	1.3505
1.1275	2.0435	36	1.3344
0.6652	2.5217	45	1.4360
0.6248	3.0435	54	1.4313
0.6142	3.5072	63	1.4934

Training hyperparemeters:

The following hyperparameters were used during training:

learning_rate: 0.0002
train_batch_size: 2
eval_batch_size: 2
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 8
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 10
num_epochs: 4

Base model: https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

Downloads last month: 114

GGUF

Model size

1.1B params

Architecture

llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for afrideva/refusal-GGUF

Base model

mrfakename/refusal

Quantized

(3)

this model

Dataset used to train afrideva/refusal-GGUF