Update README.md
Browse files
README.md
CHANGED
@@ -2,15 +2,11 @@
|
|
2 |
|
3 |
|
4 |
```
|
5 |
-
vllm
|
6 |
-
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
|
7 |
-
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|
8 |
-
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.7400|± |0.0121|
|
9 |
-
| | |strict-match | 5|exact_match|↑ |0.7415|± |0.0121|
|
10 |
|
11 |
-
vllm (pretrained=nm-testing/Meta-Llama-3-8B-Instruct-FP8-K-V,
|
12 |
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
|
13 |
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|
14 |
-
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.
|
15 |
-
| | |strict-match | 5|exact_match|↑ |0.
|
16 |
```
|
|
|
2 |
|
3 |
|
4 |
```
|
5 |
+
lm_eval --model vllm --model_args pretrained=nm-testing/Meta-Llama-3-8B-Instruct-FP8-K-V,kv_cache_dtype=fp8,add_bos_token=True --tasks gsm8k --num_fewshot 5 --batch_size auto
|
|
|
|
|
|
|
|
|
6 |
|
7 |
+
vllm (pretrained=nm-testing/Meta-Llama-3-8B-Instruct-FP8-K-V,kv_cache_dtype=fp8,add_bos_token=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: auto
|
8 |
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
|
9 |
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|
10 |
+
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.7748|± |0.0115|
|
11 |
+
| | |strict-match | 5|exact_match|↑ |0.7763|± |0.0115|
|
12 |
```
|