Benchmarks for WRN-2?

#1
by Tonic - opened

๐Ÿ™‹๐Ÿปโ€โ™‚๏ธare there any benchmarks, common on standard for benchmarking this model (idea?)

WhiteRabbitNeo org

Hey Tonic,
We did HumanEval, but it is not the best fit for this type of a model. Weโ€™re in the process of creating our own internal evaluation right now.

Sign up or log in to comment