HPAI-BSC
/

Llama3.1-Aloe-Beta-70B

Question Answering

Inference Endpoints

Model card Files Files and versions Community

JordiBayarri commited on about 21 hours ago

Commit

c877fb8

•

1 Parent(s): 70cd22a

Update README.md

Files changed (1) hide show

README.md +5 -8

README.md CHANGED Viewed

@@ -83,24 +83,21 @@ Aloe Beta has been tested on the most popular healthcare QA datasets, with and w
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/Ad9Rs3rh_z3LxuqdcKdpy.png)
-<!---
-The Beta model has been developed to excel in several different medical tasks. For this reason, we evaluated the model in many different medical tasks:
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/ZABYUxpQRMDcrJmKhkEfz.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/2NW3im0aH2u6RKp969sjx.png)
--->
 We also compared the performance of the model in the general domain, using the OpenLLM Leaderboard benchmark. Aloe-Beta gets competitive results with the current SOTA general models in the most used general benchmarks and outperforms the medical models:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/UKW36y9yjqn3Q5OfrCuIc.png)
-More evaluations coming soon!
 ## Uses
@@ -263,7 +260,7 @@ The training set consists of around 1.8B tokens, having 3 different types of dat
   - [HPAI-BSC/headqa-cot-llama31](https://huggingface.co/datasets/HPAI-BSC/headqa-cot-llama31)
   - [HPAI-BSC/MMLU-medical-cot-llama31](https://huggingface.co/datasets/HPAI-BSC/MMLU-medical-cot-llama31)
   - [HPAI-BSC/Polymed-QA](https://huggingface.co/datasets/HPAI-BSC/Polymed-QA)
-- General data. It includes maths, STEM, code, function calling, and instruction of very long instructions.
   - [HPAI-BSC/Aloe-Beta-General-Collection](https://huggingface.co/datasets/HPAI-BSC/Aloe-Beta-General-Collection)
 #### Training parameters

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/Ad9Rs3rh_z3LxuqdcKdpy.png)
+The Beta model has been developed to excel in several different medical tasks. For this reason, we evaluated the model in many different medical benchmarks:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/lPcEzQbWRq13H6tN_mEg5.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/ORkSfVkwXqefEtDnIBMOJ.png)
 We also compared the performance of the model in the general domain, using the OpenLLM Leaderboard benchmark. Aloe-Beta gets competitive results with the current SOTA general models in the most used general benchmarks and outperforms the medical models:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/UKW36y9yjqn3Q5OfrCuIc.png)
 ## Uses
   - [HPAI-BSC/headqa-cot-llama31](https://huggingface.co/datasets/HPAI-BSC/headqa-cot-llama31)
   - [HPAI-BSC/MMLU-medical-cot-llama31](https://huggingface.co/datasets/HPAI-BSC/MMLU-medical-cot-llama31)
   - [HPAI-BSC/Polymed-QA](https://huggingface.co/datasets/HPAI-BSC/Polymed-QA)
+- General data. It includes maths, STEM, code, function calling, and instruction with very long context.
   - [HPAI-BSC/Aloe-Beta-General-Collection](https://huggingface.co/datasets/HPAI-BSC/Aloe-Beta-General-Collection)
 #### Training parameters