Shaltiel commited on
Commit
60f8b33
โ€ข
1 Parent(s): 93a12a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +86 -0
README.md CHANGED
@@ -1,3 +1,89 @@
1
  ---
2
  license: cc-by-4.0
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-4.0
3
+ language:
4
+ - he
5
  ---
6
+ # DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
7
+
8
+ State-of-the-art language model for Hebrew, released [here](https://arxiv.org/abs/2308.16687).
9
+
10
+ This is the fine-tuned BERT-base model for the named-entity-recognition task.
11
+
12
+ For the bert-base models for other tasks, see [here](https://huggingface.co/collections/dicta-il/dictabert-6588e7cc08f83845fc42a18b).
13
+
14
+ For the bert-large models for other tasks, see [to-be-added].
15
+
16
+ Sample usage:
17
+
18
+ ```python
19
+ from transformers import pipeline
20
+
21
+ oracle = pipeline('ner', model='dicta-il/dictabert-ner')
22
+
23
+ sentence = 'ื”ื›ื™ ื“ืจืžื˜ื™ ืฉื™ืฉ: ืฉืขืจ ืฉืœ ืกื“ืจื™ืง ื”ืžื—ืœื™ืฃ ื”ืขื ื™ืง ืœื–ื™ื• ืืจื™ื” ื ื™ืฆื—ื•ืŸ ืฉื ื™ ื‘ืฉืœื•ืฉื” ืžืฉื—ืงื™ื ื•ืขืœื™ื™ื” ืžืขืœ ื”ืงื• ื”ืื“ื•ื.'
24
+ oracle(sentence)
25
+ ```
26
+
27
+ Output:
28
+ ```json
29
+ [
30
+ {
31
+ "entity": "B-PER",
32
+ "score": "0.99827814",
33
+ "index": 7,
34
+ "word": "ืกื“ืจื™ืง",
35
+ "start": 22,
36
+ "end": 27
37
+ },
38
+ {
39
+ "entity": "B-PER",
40
+ "score": "0.9995943",
41
+ "index": 10,
42
+ "word": "ืœื–ื™",
43
+ "start": 41,
44
+ "end": 44
45
+ },
46
+ {
47
+ "entity": "I-PER",
48
+ "score": "0.9995154",
49
+ "index": 12,
50
+ "word": "ืืจื™ื”",
51
+ "start": 46,
52
+ "end": 50
53
+ }
54
+ ]
55
+ ```
56
+
57
+ ## Citation
58
+
59
+ If you use DictaBERT in your research, please cite ```DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew```
60
+
61
+ **BibTeX:**
62
+
63
+ ```bibtex
64
+ @misc{shmidman2023dictabert,
65
+ title={DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew},
66
+ author={Shaltiel Shmidman and Avi Shmidman and Moshe Koppel},
67
+ year={2023},
68
+ eprint={2308.16687},
69
+ archivePrefix={arXiv},
70
+ primaryClass={cs.CL}
71
+ }
72
+ ```
73
+
74
+ ## License
75
+
76
+ Shield: [![CC BY 4.0][cc-by-shield]][cc-by]
77
+
78
+ This work is licensed under a
79
+ [Creative Commons Attribution 4.0 International License][cc-by].
80
+
81
+ [![CC BY 4.0][cc-by-image]][cc-by]
82
+
83
+ [cc-by]: http://creativecommons.org/licenses/by/4.0/
84
+ [cc-by-image]: https://i.creativecommons.org/l/by/4.0/88x31.png
85
+ [cc-by-shield]: https://img.shields.io/badge/License-CC%20BY%204.0-lightgrey.svg
86
+
87
+
88
+
89
+