Update README.md
Browse files
README.md
CHANGED
@@ -24,9 +24,9 @@ One-off run using a [modified version](https://gist.github.com/bclavie/93d3b161d
|
|
24 |
|
25 |
w/o extensive tuning, the model trains considerably faster than BERT-base, and gets **+5 Weighted F1**:
|
26 |
|
27 |
-
|
28 |
|
29 |
-
|
30 |
|
31 |
**Weighted F1: 0.76**
|
32 |
|
@@ -48,7 +48,7 @@ Validation Report:
|
|
48 |
weighted avg 0.76 0.76 0.76 46743
|
49 |
```
|
50 |
|
51 |
-
|
52 |
|
53 |
**Weighted F1: 0.71**
|
54 |
|
@@ -71,7 +71,7 @@ weighted avg 0.71 0.71 0.71 46867
|
|
71 |
|
72 |
(for some reason, the currently available annotated dataset is identical, except that it's missing 124 of the 125 5-rated examples. These are so anecdotal they have no real impact on the weighted metrics.)
|
73 |
|
74 |
-
|
75 |
|
76 |
Most parameters detailed in the script. Key hparams:
|
77 |
|
|
|
24 |
|
25 |
w/o extensive tuning, the model trains considerably faster than BERT-base, and gets **+5 Weighted F1**:
|
26 |
|
27 |
+
# Results
|
28 |
|
29 |
+
## ModernBERT-base-fineweb-edu-example
|
30 |
|
31 |
**Weighted F1: 0.76**
|
32 |
|
|
|
48 |
weighted avg 0.76 0.76 0.76 46743
|
49 |
```
|
50 |
|
51 |
+
## Original Classifier (https://huggingface.co/HuggingFaceFW/fineweb-edu-classifier):
|
52 |
|
53 |
**Weighted F1: 0.71**
|
54 |
|
|
|
71 |
|
72 |
(for some reason, the currently available annotated dataset is identical, except that it's missing 124 of the 125 5-rated examples. These are so anecdotal they have no real impact on the weighted metrics.)
|
73 |
|
74 |
+
# Params
|
75 |
|
76 |
Most parameters detailed in the script. Key hparams:
|
77 |
|