update readme
Browse files
README.md
CHANGED
@@ -9,37 +9,33 @@ metrics:
|
|
9 |
model-index:
|
10 |
- name: xlm-ate-nobi-mul
|
11 |
results: []
|
|
|
|
|
12 |
---
|
13 |
|
14 |
-
|
15 |
-
should probably proofread and complete it, then remove this comment. -->
|
16 |
|
17 |
-
|
18 |
-
|
19 |
-
This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on an unknown dataset.
|
20 |
-
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 0.6371
|
22 |
-
- Precision: 0.0
|
23 |
-
- Recall: 0.0
|
24 |
-
- F1: 0
|
25 |
|
26 |
## Model description
|
27 |
|
28 |
-
|
|
|
29 |
|
30 |
## Intended uses & limitations
|
31 |
|
32 |
-
|
|
|
|
|
33 |
|
34 |
## Training and evaluation data
|
35 |
|
36 |
-
|
37 |
|
38 |
## Training procedure
|
39 |
|
40 |
-
### Training hyperparameters
|
41 |
-
|
42 |
The following hyperparameters were used during training:
|
|
|
43 |
- learning_rate: 2e-05
|
44 |
- train_batch_size: 32
|
45 |
- eval_batch_size: 32
|
@@ -47,22 +43,30 @@ The following hyperparameters were used during training:
|
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
- lr_scheduler_type: linear
|
49 |
- num_epochs: 20
|
|
|
50 |
|
51 |
-
|
52 |
-
|
53 |
-
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 |
|
54 |
-
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:--:|
|
55 |
-
| 0.2817 | 0.45 | 500 | 0.4949 | 0.0 | 0.0 | 0 |
|
56 |
-
| 0.1887 | 0.91 | 1000 | 0.5226 | 0.0 | 0.0 | 0 |
|
57 |
-
| 0.1493 | 1.36 | 1500 | 0.5965 | 0.0 | 0.0 | 0 |
|
58 |
-
| 0.1335 | 1.82 | 2000 | 0.6271 | 0.0 | 0.0 | 0 |
|
59 |
-
| 0.1166 | 2.27 | 2500 | 0.7660 | 0.0 | 0.0 | 0 |
|
60 |
-
| 0.1057 | 2.72 | 3000 | 0.6371 | 0.0 | 0.0 | 0 |
|
61 |
-
|
62 |
-
|
63 |
-
### Framework versions
|
64 |
-
|
65 |
- Transformers 4.26.1
|
66 |
- Pytorch 2.0.1+cu117
|
67 |
- Datasets 2.9.0
|
68 |
- Tokenizers 0.13.2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
model-index:
|
10 |
- name: xlm-ate-nobi-mul
|
11 |
results: []
|
12 |
+
language:
|
13 |
+
- en
|
14 |
---
|
15 |
|
16 |
+
# XLMR Token Classifier for Term Extraction
|
|
|
17 |
|
18 |
+
This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) for cross-domain term extraction tasks.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
|
20 |
## Model description
|
21 |
|
22 |
+
This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) for token classification, specifically designed to identify and classify terms within text sequences. The model assigns labels such as B-Term, I-Term, BN-Term, IN-Term, and O to individual tokens, allowing for the extraction of meaningful terms from the text.
|
23 |
+
|
24 |
|
25 |
## Intended uses & limitations
|
26 |
|
27 |
+
The model is intended for term extraction tasks. It can be applied in domains like:
|
28 |
+
- Named Entity Recognition (NER)
|
29 |
+
- Information Extraction
|
30 |
|
31 |
## Training and evaluation data
|
32 |
|
33 |
+
We fine-tuned the ACTER dataset where Named Entities are excluded from the gold standard. We trained on the Corruption and Wind Energy domain of all three languages (English, French, Dutch), and the Slovenian RSDO5 corpus, validated on the Equitation domain and tested on the Heart Failure domain.
|
34 |
|
35 |
## Training procedure
|
36 |
|
|
|
|
|
37 |
The following hyperparameters were used during training:
|
38 |
+
```
|
39 |
- learning_rate: 2e-05
|
40 |
- train_batch_size: 32
|
41 |
- eval_batch_size: 32
|
|
|
43 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
44 |
- lr_scheduler_type: linear
|
45 |
- num_epochs: 20
|
46 |
+
```
|
47 |
|
48 |
+
Framework versions:
|
49 |
+
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
- Transformers 4.26.1
|
51 |
- Pytorch 2.0.1+cu117
|
52 |
- Datasets 2.9.0
|
53 |
- Tokenizers 0.13.2
|
54 |
+
```
|
55 |
+
|
56 |
+
## Evaluation
|
57 |
+
|
58 |
+
We evaluate the performance of the ATE systems by comparing the candidate list extracted from the test set with the manually annotated gold standard term list for that specific test set. We use exact string matching to compare the retrieved terms to the ones in the gold standard and calculate Precision (P), Recall (R), and F1-score (F1).
|
59 |
+
The results are reported in [Can cross-domain term extraction benefit from cross-lingual transfer and nested term labeling?](https://link.springer.com/article/10.1007/s10994-023-06506-7#Sec12).
|
60 |
+
|
61 |
+
## Citation
|
62 |
+
If you use this model in your research or application, please cite it as follows:
|
63 |
+
```
|
64 |
+
@inproceedings{tran2022can,
|
65 |
+
title={Can cross-domain term extraction benefit from cross-lingual transfer?},
|
66 |
+
author={Tran, Hanh Thi Hong and Martinc, Matej and Doucet, Antoine and Pollak, Senja},
|
67 |
+
booktitle={International Conference on Discovery Science},
|
68 |
+
pages={363--378},
|
69 |
+
year={2022},
|
70 |
+
organization={Springer}
|
71 |
+
}
|
72 |
+
```
|