PygmalionAI
/

metharme-1.3b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alpindale commited on Jun 3, 2023

Commit

0fcc1d2

•

1 Parent(s): e3076c8

Add training procedure info

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -66,6 +66,10 @@ Which might generate something like:
 Same process applies. Usually, it is best to do a sliding window over the user and model turns, but keep the system prompt fixed at the start of the context window.
 ## Evaluation Metrics
 The model was evaluated using EleutherAI's [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) test suite. It was evaluated on the following tasks:

 Same process applies. Usually, it is best to do a sliding window over the user and model turns, but keep the system prompt fixed at the start of the context window.
+## Training Procedure
+This model was trained using the Metharme-v2 dataset (1 epoch) with 4x A100-40G GPUs. The run took 12 hours with `bsz=2` and `gradient_accumulation_steps=1024`.
 ## Evaluation Metrics
 The model was evaluated using EleutherAI's [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) test suite. It was evaluated on the following tasks: