About LLaVA Bench
#52
by
Pistachioo
- opened
I am a Korean developer, and I am interested in Korean benchmarks.
While I was surprised by the model's Korean score compared with other models, I got two curious question
I couldn't find the same score on other models' card. So did you get that score by your own test?
LLAVA Benchmark actually do not provide Korean dataset. So did I wonder how you made it and its step by step details
Also thank you for releasing such a fancy model~