Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
onekqΒ 
posted an update 2 days ago
Post
1147
Mistral Small 3 is SUPER fast, and highest score for 20+b model, but still 11 points below Qwen 2.5 coder 32b.

I believe specialty model is the future. The more you know what to do with the model, the better bang you can get for your buck. If Mistral scopes this small model to coding only, I'm confident they can beat Qwen.

One day my leaderboard will be dominated by smol models excellent on one thing, not monolithic ones costing $$$. And I'm looking forward to that.

onekq-ai/WebApp1K-models-leaderboard

To matters most in models:

  1. That they are free software by definition:

  2. That models follow instructions well;

  3. That models can fit into consumer grade GPU and CPU; something like 24 GB and 64 GB RAM;

Accuracy doesn't matter too much, as when I need accuracy, I may use DeepSeek or other larger truly free software model.

Supporting companies that deceitfully claim to support "Open Source" is useless.

In this post