Benchmarks for WRN-2?
#1
by
Tonic
- opened
๐๐ปโโ๏ธare there any benchmarks, common on standard for benchmarking this model (idea?)
Hey Tonic,
We did HumanEval, but it is not the best fit for this type of a model. Weโre in the process of creating our own internal evaluation right now.