chefkoch24 commited on
Commit
66fb80e
1 Parent(s): 401108b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md CHANGED
@@ -1,3 +1,45 @@
1
  ---
2
  license: openrail
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: openrail
3
+ language:
4
+ - de
5
+ metrics:
6
+ - f1
7
+ - accuracy
8
+ - precision
9
+ - recall
10
+ pipeline_tag: token-classification
11
+ tags:
12
+ - recipe
13
+ - cooking
14
+ - entity_recognition
15
  ---
16
+ Weakly supervised token classification model for German recipe texts based on bert-base-german-cased.
17
+
18
+ Code available: https://github.com/chefkoch24/weak-ingredient-recognition
19
+
20
+ Dataset: https://www.kaggle.com/datasets/sterby/german-recipes-dataset
21
+
22
+ Recognizes the following entities:
23
+ 'O': 0, <br>
24
+ 'B-INGREDIENT': 1,<br>
25
+ 'I-INGREDIENT': 2,<br>
26
+ 'B-UNIT': 3,<br>
27
+ 'I-UNIT': 4,<br>
28
+ 'B-QUANTITY': 5,<br>
29
+ 'I-QUANTITY': 6<br>
30
+
31
+ **Training:**
32
+ epochs: 2<br>
33
+ optimizer: Adam<br>
34
+ learning rate: 2e-5<br>
35
+ max length: 512<br>
36
+ recipes: 7801<br>
37
+
38
+ The model was trained on single Geforce RTX2080 with 11GB GPU
39
+
40
+
41
+ **Metrics on test set (weakly supervised)**:
42
+ accuracy_token 0.9965656995773315<br>
43
+ f1_token 0.9965656995773315<br>
44
+ precision_token 0.9965656995773315<br>
45
+ recall_token 0.9965656995773315<br>