Update README.md
Browse files
README.md
CHANGED
@@ -56,8 +56,8 @@ At the time of release (February 2024), **MedGENIE-fid-flan-t5-base-medqa** is a
|
|
56 |
| Codex <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | R (Wikipedia) | 0-shot | 175B | 52.5 |
|
57 |
| GPT-3.5-Turbo <small>([Yang et al.](https://arxiv.org/abs/2309.02233))</small> | R (Wikipedia) | k-shot | -- | 52.3 |
|
58 |
| MEDITRON <small>([Chen et al.](https://arxiv.org/abs/2311.16079))</small> | ∅ | Fine-tuned | 7B | 52.0 |
|
59 |
-
| BioMistral DARE <small> ([Labrak et al](https://arxiv.org/abs/2402.10373)) </small> | ∅ | Fine-tuned | 7B | 51.1 |
|
60 |
-
| BioMistral <small> ([Labrak et al](https://arxiv.org/abs/2402.10373)) </small> | ∅ | Fine-tuned | 7B | 50.6 |
|
61 |
| Zephyr-β | R (MedWiki) | 2-shot | 7B | 50.4 |
|
62 |
| BioMedGPT <small>([Luo et al.](https://arxiv.org/abs/2308.09442v2))</small> | ∅ | k-shot | 10B | 50.4 |
|
63 |
| BioMedLM <small>([Singhal et al.](https://arxiv.org/abs/2212.13138))</small> | ∅ | Fine-tuned | 2.7B | 50.3 |
|
@@ -68,14 +68,14 @@ At the time of release (February 2024), **MedGENIE-fid-flan-t5-base-medqa** is a
|
|
68 |
| PMC-LLaMA <small>([Chen et al.](https://arxiv.org/abs/2311.16079))</small> | ∅ | Fine-tuned | 7B | 49.2 |
|
69 |
| DRAGON <small>([Yasunaga et al.](https://arxiv.org/abs/2210.09338))</small> | R (UMLS) | Fine-tuned | 360M | 47.5 |
|
70 |
| InstructGPT <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | R (Wikipedia) | 0-shot | 175B | 47.3 |
|
71 |
-
| BioMistral DARE <small> ([Labrak et al](https://arxiv.org/abs/2402.10373)) </small> | ∅ | 3-shot | 7B | 47.0 |
|
72 |
| Flan-PaLM <small>([Singhal et al.](https://arxiv.org/abs/2212.13138))</small> | ∅ | 5-shot | 62B | 46.1 |
|
73 |
| InstructGPT <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | ∅ | 0-shot | 175B | 46.0 |
|
74 |
| VOD <small>([Liévin et al. 2023](https://arxiv.org/abs/2210.06345))</small> | R (MedWiki) | Fine-tuned | 220M | 45.8 |
|
75 |
| Vicuna 1.3 <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | ∅ | 0-shot | 33B | 45.2 |
|
76 |
| BioLinkBERT <small>([Singhal et al.](https://arxiv.org/abs/2212.13138))</small> | ∅ | Fine-tuned | 340M | 45.1 |
|
77 |
| Mistral-Instruct | R (MedWiki) | 2-shot | 7B | 45.1 |
|
78 |
-
| BioMistral <small> ([Labrak et al](https://arxiv.org/abs/2402.10373)) </small> | ∅ | 3-shot | 7B | 44.4 |
|
79 |
| Galactica | ∅ | 0-shot | 120B | 44.4 |
|
80 |
| LLaMA-2 <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | ∅ | 0-shot | 70B | 43.4 |
|
81 |
| BioReader <small>([Frisoni et al.](https://aclanthology.org/2022.emnlp-main.390/))</small> | R (PubMed-RCT) | Fine-tuned | 230M | 43.0 |
|
|
|
56 |
| Codex <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | R (Wikipedia) | 0-shot | 175B | 52.5 |
|
57 |
| GPT-3.5-Turbo <small>([Yang et al.](https://arxiv.org/abs/2309.02233))</small> | R (Wikipedia) | k-shot | -- | 52.3 |
|
58 |
| MEDITRON <small>([Chen et al.](https://arxiv.org/abs/2311.16079))</small> | ∅ | Fine-tuned | 7B | 52.0 |
|
59 |
+
| BioMistral DARE <small> ([Labrak et al.](https://arxiv.org/abs/2402.10373)) </small> | ∅ | Fine-tuned | 7B | 51.1 |
|
60 |
+
| BioMistral <small> ([Labrak et al.](https://arxiv.org/abs/2402.10373)) </small> | ∅ | Fine-tuned | 7B | 50.6 |
|
61 |
| Zephyr-β | R (MedWiki) | 2-shot | 7B | 50.4 |
|
62 |
| BioMedGPT <small>([Luo et al.](https://arxiv.org/abs/2308.09442v2))</small> | ∅ | k-shot | 10B | 50.4 |
|
63 |
| BioMedLM <small>([Singhal et al.](https://arxiv.org/abs/2212.13138))</small> | ∅ | Fine-tuned | 2.7B | 50.3 |
|
|
|
68 |
| PMC-LLaMA <small>([Chen et al.](https://arxiv.org/abs/2311.16079))</small> | ∅ | Fine-tuned | 7B | 49.2 |
|
69 |
| DRAGON <small>([Yasunaga et al.](https://arxiv.org/abs/2210.09338))</small> | R (UMLS) | Fine-tuned | 360M | 47.5 |
|
70 |
| InstructGPT <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | R (Wikipedia) | 0-shot | 175B | 47.3 |
|
71 |
+
| BioMistral DARE <small> ([Labrak et al.](https://arxiv.org/abs/2402.10373)) </small> | ∅ | 3-shot | 7B | 47.0 |
|
72 |
| Flan-PaLM <small>([Singhal et al.](https://arxiv.org/abs/2212.13138))</small> | ∅ | 5-shot | 62B | 46.1 |
|
73 |
| InstructGPT <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | ∅ | 0-shot | 175B | 46.0 |
|
74 |
| VOD <small>([Liévin et al. 2023](https://arxiv.org/abs/2210.06345))</small> | R (MedWiki) | Fine-tuned | 220M | 45.8 |
|
75 |
| Vicuna 1.3 <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | ∅ | 0-shot | 33B | 45.2 |
|
76 |
| BioLinkBERT <small>([Singhal et al.](https://arxiv.org/abs/2212.13138))</small> | ∅ | Fine-tuned | 340M | 45.1 |
|
77 |
| Mistral-Instruct | R (MedWiki) | 2-shot | 7B | 45.1 |
|
78 |
+
| BioMistral <small> ([Labrak et al.](https://arxiv.org/abs/2402.10373)) </small> | ∅ | 3-shot | 7B | 44.4 |
|
79 |
| Galactica | ∅ | 0-shot | 120B | 44.4 |
|
80 |
| LLaMA-2 <small>([Liévin et al.](https://arxiv.org/abs/2207.08143))</small> | ∅ | 0-shot | 70B | 43.4 |
|
81 |
| BioReader <small>([Frisoni et al.](https://aclanthology.org/2022.emnlp-main.390/))</small> | R (PubMed-RCT) | Fine-tuned | 230M | 43.0 |
|