DILAB-HYU
/

KoQuality-Polyglot-5.8b

@@ -10,35 +10,35 @@ KoQuality-Polyglot-5.8b is a fine-tuned version of [EleutherAI/polyglot-ko-5.8b]
 ## Overall Average accuracy score of the KoBEST datasets
 We use [KoBEST benchmark](https://huggingface.co/datasets/skt/kobest_v1) datasets(BoolQ, COPA, HellaSwag, SentiNeg, WiC) to compare the performance of our best model and other models accuracy. Our model outperforms other models in the average accuracy score of the KoBEST datasets.
-<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/q4cCUCzRJa3m2f7oxI_FY.png style="max-width: 500px; width: 300%"/>
-| Model | 0-shot | 1-shot | 2-shot | 5-shot | 10-shot
-| --- | --- | --- | --- | --- | --- |
-| polyglot-ko-5.8b | 0.5587 | 0.5977 | 0.6138 | 0.6431 | 0.6457
-| koalpcaca-polyglot-5.8b | 0.5085 | 0.5561 | 0.5768 | 0.6097 | 0.6059
-| kullm-polyglot-5.8b | 0.5409 | 0.6072 | 0.5945 | 0.6345 | 0.6530
-| koquality-polyglot-5.8b | 0.5472 | 0.5979 | 0.6260 | 0.6486 | 0.6535
-## Evaluation results
-### COPA (F1)
-<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/7EKl1OAgKgPBFcSlGzBiW.png style="max-width: 600px; width: 400%"/>
 | Model | 0-shot | 1-shot | 2-shot | 5-shot | 10-shot
 | --- | --- | --- | --- | --- | --- |
-| polyglot-ko-5.8b | 0.5587 | 0.5977 | 0.6138 | 0.6431 | 0.6457
-| koalpcaca-polyglot-5.8b | 0.5085 | 0.5561 | 0.5768 | 0.6097 | 0.6059
-| kullm-polyglot-5.8b | 0.5409 | 0.6072 | 0.5945 | 0.6345 | 0.6530
-| koquality-polyglot-5.8b | 0.5472 | 0.5979 | 0.6260 | 0.6486 | 0.6535
 ### HellaSwag (F1)
-### BoolQ (F1)
 ### SentiNeg (F1)
 ### WiC (F1)

 ## Overall Average accuracy score of the KoBEST datasets
 We use [KoBEST benchmark](https://huggingface.co/datasets/skt/kobest_v1) datasets(BoolQ, COPA, HellaSwag, SentiNeg, WiC) to compare the performance of our best model and other models accuracy. Our model outperforms other models in the average accuracy score of the KoBEST datasets.
+<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/t5x4PphoNb-tW3iCzXXHT.png style="max-width: 500px; width: 300%"/>
 | Model | 0-shot | 1-shot | 2-shot | 5-shot | 10-shot
 | --- | --- | --- | --- | --- | --- |
+| polyglot-ko-5.8b | 0.4734 | 0.5929 | 0.6120 | 0.6388 | 0.6295
+| koalpcaca-polyglot-5.8b | 0.4731 | 0.5284 | 0.5721 | 0.6054 | 0.6042
+| kullm-polyglot-5.8b | 0.4415 | 0.6030 | 0.5849 | 0.6252 | 0.6451
+| koquality-polyglot-5.8b | 0.4530 | 0.6050 | 0.6351 | 0.6420 | 0.6457
+## Evaluation results
+### COPA (F1)
+<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/QAie0x99S8-KEKvK0I_uZ.png style="max-width: 500px; width: 200%"/>
+### BoolQ (F1)
+<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/CtEWEQ5BBS05V9cDWA7kp.png style="max-width: 500px; width: 200%"/>
 ### HellaSwag (F1)
+<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/cHws6qWkDlTfs5GVcQvtN.png style="max-width: 500px; width: 200%"/>
 ### SentiNeg (F1)
+<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/VEG15XXOIbzJyQAusLa4B.png style="max-width: 500px; width: 200%"/>
 ### WiC (F1)
+<img src=https://cdn-uploads.huggingface.co/production/uploads/650fecfd247f564485f8fbcf/hV-uADJiydkVQOyYysej9.png style="max-width: 500px; width: 200%"/>