|
--- |
|
tags: |
|
- medical |
|
- mmlu |
|
- medalpaca |
|
- medmcqa |
|
datasets: |
|
- cais/mmlu |
|
- medalpaca/medical_meadow_medqa |
|
- medalpaca/medical_meadow_wikidoc |
|
- openlifescienceai/medmcqa |
|
- bigbio/med_qa |
|
- GBaker/MedQA-USMLE-4-options |
|
- medalpaca/medical_meadow_mmmlu |
|
- medalpaca/medical_meadow_wikidoc_patient_information |
|
- qiaojin/PubMedQA |
|
pipeline_tag: text-generation |
|
--- |
|
### Evaluation results |
|
|
|
| Dataset | GPT-3.5 | Tuned Llama 3 V1 | Tuned Llama 3 V2 | |
|
|:-------------:|:-----:|:----:|:----:| |
|
| MMLU Clinical Knowledge | 69.8| 74.34 | 73.20 | |
|
| MMLU College Biology | 72.2| 72.92 | 74.30 | |
|
| MMLU College Medicine | 61.3| 61.85 | 66.47 | |
|
| MMLU Medical Genetics | 70.0| 76.0 | 74.0 | |
|
| MMLU Professional Medicine| 70.2| 72.43 | 71.32 | |
|
| MMLU Anatomy | 56.3| 61.48 | 64.44 | |