hf-causal-experimental (pretrained=openaccess-ai-collective/mighty-llama-1b,use_accelerate=True,dtype=bfloat16,trust_remote_code=True), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
| Task |Version| Metric |Value | |Stderr|
|-------------|------:|--------|-----:|---|-----:|
|arc_challenge| 0|acc |0.2355|_ |0.0124|
| | |acc_norm|0.2671|_ |0.0129|
|arc_easy | 0|acc |0.4444|_ |0.0102|
| | |acc_norm|0.4276|_ |0.0102|
|boolq | 1|acc |0.5358|_ |0.0087|
|hellaswag | 0|acc |0.3784|_ |0.0048|
| | |acc_norm|0.5034|_ |0.0050|
|openbookqa | 0|acc |0.1580|_ |0.0163|
| | |acc_norm|0.2840|_ |0.0202|
|piqa | 0|acc |0.6518|_ |0.0111|
| | |acc_norm|0.6464|_ |0.0112|
|winogrande | 0|acc |0.5422|_ |0.0140|