Update model spec
Browse files
README.md
CHANGED
@@ -11,4 +11,6 @@ tags:
|
|
11 |
- Administrative documents
|
12 |
---
|
13 |
|
14 |
-
# AdminBERT 4GB: A Small French Language model adapted to Administrative documents
|
|
|
|
|
|
11 |
- Administrative documents
|
12 |
---
|
13 |
|
14 |
+
# AdminBERT 4GB: A Small French Language model adapted to Administrative documents
|
15 |
+
|
16 |
+
[AdminBERT-4GB](example) is a French language model adapted on a large corpus of 10 millions French administrative texts. It is a derivative of CamemBERT model, which is based on the RoBERTa architecture. AdminBERT-4GB is trained using the Masked Language Modeling (MLM) objective with 30% mask rate for 2 epochs on 8 V100 GPUs. The dataset used for training is a sample of [Adminset](https://huggingface.co/datasets/taln-ls2n/Adminset).
|