Update README.md
Browse files
README.md
CHANGED
@@ -24,8 +24,8 @@ language:
|
|
24 |
- **License(s):** [llama3.1](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B/blob/main/LICENSE)
|
25 |
- **Model Developers:** Neural Magic
|
26 |
|
27 |
-
Quantized version of [Meta-Llama-3.1-405B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct).
|
28 |
-
It achieves an average score of 78.69 on the [OpenLLM](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) benchmark (version 1), whereas the unquantized model achieves 78.67.
|
29 |
|
30 |
### Model Optimizations
|
31 |
|
|
|
24 |
- **License(s):** [llama3.1](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B/blob/main/LICENSE)
|
25 |
- **Model Developers:** Neural Magic
|
26 |
|
27 |
+
Quantized version of [Meta-Llama-3.1-405B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct). It achieves an average recovery of 99.82% on the [OpenLLM](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) benchmark (version 1), compared to the unquantized model.
|
28 |
+
<!-- It achieves an average score of 78.69 on the [OpenLLM](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) benchmark (version 1), whereas the unquantized model achieves 78.67. -->
|
29 |
|
30 |
### Model Optimizations
|
31 |
|