TSebbag commited on
Commit
987f8ac
·
verified ·
1 Parent(s): 3a0b741

Update model spec

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -11,4 +11,6 @@ tags:
11
  - Administrative documents
12
  ---
13
 
14
- # AdminBERT 4GB: A Small French Language model adapted to Administrative documents
 
 
 
11
  - Administrative documents
12
  ---
13
 
14
+ # AdminBERT 4GB: A Small French Language model adapted to Administrative documents
15
+
16
+ [AdminBERT-4GB](example) is a French language model adapted on a large corpus of 10 millions French administrative texts. It is a derivative of CamemBERT model, which is based on the RoBERTa architecture. AdminBERT-4GB is trained using the Masked Language Modeling (MLM) objective with 30% mask rate for 2 epochs on 8 V100 GPUs. The dataset used for training is a sample of [Adminset](https://huggingface.co/datasets/taln-ls2n/Adminset).