cmarkea
/

bloomz-3b-dpo-chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Jul 5

Commit

bb07343

•

1 Parent(s): 28765ca

Update README.md

Files changed (1) hide show

README.md +12 -11

README.md CHANGED Viewed

@@ -51,17 +51,18 @@ Evaluation of the model was conducted using the PoLL (Pool of LLM) technique, as
 (two per evaluator). The evaluators included GPT-4o, Gemini-1.5-pro, and Claude3.5-sonnet.
 **Performance Scores (on a scale of 5):**
-| Model                                        | Score   |
-|---------------------------------------------:|:--------|
-| gpt-4o                                       | 4.13    |
-| mistralai/Mixtral-8x7B-Instruct-v0.1         | 3.71    |
-| gpt-3.5-turbo                                | 3.66    |
-| cmarkea/bloomz-7b1-mt-sft-chat               | 1.69    |
-| cmarkea/bloomz-3b-dpo-chat                   | 1.68    |
-| cmarkea/bloomz-3b-sft-chat                   | 1.51    |
-| croissantllm/CroissantLLMChat-v0.1           | 1.19    |
-| cmarkea/bloomz-560m-sft-chat                 | 1.04    |
-| OpenLLM-France/Claire-Mistral-7B-0.1         | 0.38    |
 The bloomz-3b-dpo-chat model demonstrates improved performance over its SFT counterpart, particularly in zero-shot contexts, making it a competitive choice for
 production environments.

 (two per evaluator). The evaluators included GPT-4o, Gemini-1.5-pro, and Claude3.5-sonnet.
 **Performance Scores (on a scale of 5):**
+| Model                                        | Score   | # params |
+|---------------------------------------------:|:-------:|:--------:|
+| gpt-4o                                       | 4.13    | N/A      |
+| mistralai/Mixtral-8x7B-Instruct-v0.1         | 3.71    | 46.7b    |
+| gpt-3.5-turbo                                | 3.66    | 175b     |
+| mistralai/Mistral-7B-Instruct-v0.2           | 1.98    | 7.25b    |
+| cmarkea/bloomz-7b1-mt-sft-chat               | 1.69    | 7.1b     |
+| cmarkea/bloomz-3b-dpo-chat                   | 1.68    | 3b       |
+| cmarkea/bloomz-3b-sft-chat                   | 1.51    | 3b       |
+| croissantllm/CroissantLLMChat-v0.1           | 1.19    | 1.3b     |
+| cmarkea/bloomz-560m-sft-chat                 | 1.04    | 0.56b    |
+| OpenLLM-France/Claire-Mistral-7B-0.1         | 0.38    | 7.25b    |
 The bloomz-3b-dpo-chat model demonstrates improved performance over its SFT counterpart, particularly in zero-shot contexts, making it a competitive choice for
 production environments.