sambanovasystems
/

BLOOMChat-176B-v1

Text Generation

text-generation-inference

Model card Files Files and versions Community

jayr014 commited on May 10, 2023

Commit

ad5e37c

•

1 Parent(s): 4432b47

adding in End Learning Ratio

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -146,6 +146,7 @@ We trained BloomChat with SambaStudio, a platform built on SambaNova's in-house
 - Learning Rate: 1e-5
 - Learning Rate Scheduler: Cosine Schedule with Warmup
 - Warmup Steps: 0
 - Weight decay: 0.1
 **Instruction-tuned Training on Dolly 2.0 and Oasst1**
@@ -159,6 +160,7 @@ We trained BloomChat with SambaStudio, a platform built on SambaNova's in-house
 - Learning Rate: 1e-5
 - Learning Rate Scheduler: Cosine Schedule with Warmup
 - Warmup Steps: 0
 - Weight decay: 0.1

 - Learning Rate: 1e-5
 - Learning Rate Scheduler: Cosine Schedule with Warmup
 - Warmup Steps: 0
+- End Learning Ratio: 0.1
 - Weight decay: 0.1
 **Instruction-tuned Training on Dolly 2.0 and Oasst1**
 - Learning Rate: 1e-5
 - Learning Rate Scheduler: Cosine Schedule with Warmup
 - Warmup Steps: 0
+- End Learning Ratio: 0.1
 - Weight decay: 0.1