abhshkp
/

litm-benchmark-suite-v4

lost-in-the-middle

Model card Files Files and versions

litm-benchmark-suite-v4

162 kB

Ctrl+K

Ctrl+K

1 contributor

History: 84 commits

abhshkp's picture

Fix scoring bug in Exp 4: check all numbers for expected answer, not just first number

3e790f1 verified 7 days ago