DILAB-HYU
/

KoQuality-Polyglot-5.8b

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nayohan commited on Nov 5, 2023

Commit

d82bc1b

•

1 Parent(s): 270b6dd

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -59,7 +59,7 @@ We use [KoBEST benchmark](https://huggingface.co/datasets/skt/kobest_v1) dataset
 - learning_rate: 5e-5
 - train_batch_size: 4
 - seed: 42
-- distributed_type: multi-GPU (A100 80G)
 - num_devices: 4
 - gradient_accumulation_steps: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08

 - learning_rate: 5e-5
 - train_batch_size: 4
 - seed: 42
+- distributed_type: multi-GPU (A100 80G) + No offloading
 - num_devices: 4
 - gradient_accumulation_steps: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08