pszemraj
/

MiniLMv2-L6-H384_R-fineweb-100k

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on May 4

Commit

2602f97

•

1 Parent(s): c9099af

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -13,12 +13,11 @@ datasets:
 # MiniLMv2-L6-H384_R-fineweb-100k
-This is a MiniLMv2 model pretrained further on an MLM task with the goal improving downstream finetuning/performance:
 - activation updated to SiLU prior to further training
 - MLM @ 40% mask ratio
--
 ## Model description
 This model is a fine-tuned version of [nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large](https://huggingface.co/nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large) on the BEE-spoke-data/fineweb-100k_en-med dataset.

 # MiniLMv2-L6-H384_R-fineweb-100k
+This is a MiniLMv2 model continually pre-trained on an MLM task with the goal of improving downstream fine-tuning/performance:
 - activation updated to SiLU prior to further training
 - MLM @ 40% mask ratio
 ## Model description
 This model is a fine-tuned version of [nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large](https://huggingface.co/nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large) on the BEE-spoke-data/fineweb-100k_en-med dataset.