Upload 6 files
Browse files
scibert_scivocab_uncased/.gitattributes
ADDED
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
*.bin.* filter=lfs diff=lfs merge=lfs -text
|
2 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
4 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
5 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
6 |
+
*.tar.gz filter=lfs diff=lfs merge=lfs -text
|
7 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
8 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
9 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
10 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
scibert_scivocab_uncased/README.md
ADDED
@@ -0,0 +1,29 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: en
|
3 |
+
---
|
4 |
+
# SciBERT
|
5 |
+
|
6 |
+
This is the pretrained model presented in [SciBERT: A Pretrained Language Model for Scientific Text](https://www.aclweb.org/anthology/D19-1371/), which is a BERT model trained on scientific text.
|
7 |
+
|
8 |
+
The training corpus was papers taken from [Semantic Scholar](https://www.semanticscholar.org). Corpus size is 1.14M papers, 3.1B tokens. We use the full text of the papers in training, not just abstracts.
|
9 |
+
|
10 |
+
SciBERT has its own wordpiece vocabulary (scivocab) that's built to best match the training corpus. We trained cased and uncased versions.
|
11 |
+
|
12 |
+
Available models include:
|
13 |
+
* `scibert_scivocab_cased`
|
14 |
+
* `scibert_scivocab_uncased`
|
15 |
+
|
16 |
+
|
17 |
+
The original repo can be found [here](https://github.com/allenai/scibert).
|
18 |
+
|
19 |
+
If using these models, please cite the following paper:
|
20 |
+
```
|
21 |
+
@inproceedings{beltagy-etal-2019-scibert,
|
22 |
+
title = "SciBERT: A Pretrained Language Model for Scientific Text",
|
23 |
+
author = "Beltagy, Iz and Lo, Kyle and Cohan, Arman",
|
24 |
+
booktitle = "EMNLP",
|
25 |
+
year = "2019",
|
26 |
+
publisher = "Association for Computational Linguistics",
|
27 |
+
url = "https://www.aclweb.org/anthology/D19-1371"
|
28 |
+
}
|
29 |
+
```
|
scibert_scivocab_uncased/config.json
ADDED
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"attention_probs_dropout_prob": 0.1,
|
3 |
+
"hidden_act": "gelu",
|
4 |
+
"hidden_dropout_prob": 0.1,
|
5 |
+
"hidden_size": 768,
|
6 |
+
"initializer_range": 0.02,
|
7 |
+
"intermediate_size": 3072,
|
8 |
+
"layer_norm_eps": 1e-12,
|
9 |
+
"max_position_embeddings": 512,
|
10 |
+
"model_type": "bert",
|
11 |
+
"num_attention_heads": 12,
|
12 |
+
"num_hidden_layers": 12,
|
13 |
+
"pad_token_id": 0,
|
14 |
+
"type_vocab_size": 2,
|
15 |
+
"vocab_size": 31090
|
16 |
+
}
|
scibert_scivocab_uncased/flax_model.msgpack
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:53d32c1d93bebe3fbc0a20e081d8575defc8d481989f97fb82c0f95f3b38f2c1
|
3 |
+
size 439681005
|
scibert_scivocab_uncased/pytorch_model.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e492944d88ac97dee6baa547671d3c526a3d067676883efb058311f4e5882e1a
|
3 |
+
size 442221694
|
scibert_scivocab_uncased/vocab.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|