Running 20 Mezura 🥇 20 Compare and evaluate large language model performance across multiple benchmarks
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • 8B • Updated Apr 10, 2025 • 306 • • 354