keras
/

electra_large_generator_uncased_en

Divyasreepat commited on 5 days ago

Commit

dbd4cd5

•

1 Parent(s): 83256cf

Update README.md with new model card content

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: keras-hub
 ---
-### Model Overview
 ELECTRA model is a pretraining approach for language models published by Google. Two transformer models are trained, a generator and a discriminator. The generator replaces tokens in a sequence and is trained as a masked language model. The discriminator is trained to discern what tokens have been replaced. This method of pretraining is more efficient than comparable methods like masked language modeling, especially for small models.
 Weights are released under the [MIT License](https://opensource.org/license/mit). Keras model code is released under the [Apache 2 License](https://github.com/keras-team/keras-hub/blob/master/LICENSE).

 ---
 library_name: keras-hub
 ---
+## Model Overview
 ELECTRA model is a pretraining approach for language models published by Google. Two transformer models are trained, a generator and a discriminator. The generator replaces tokens in a sequence and is trained as a masked language model. The discriminator is trained to discern what tokens have been replaced. This method of pretraining is more efficient than comparable methods like masked language modeling, especially for small models.
 Weights are released under the [MIT License](https://opensource.org/license/mit). Keras model code is released under the [Apache 2 License](https://github.com/keras-team/keras-hub/blob/master/LICENSE).