@mike-ravkine on Hugging Face: "Let's talk about one of the hidden gems in the ReasonScape evaluation results…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

mike-ravkine

posted an update 13 days ago

Post

3738

Let's talk about one of the hidden gems in the ReasonScape evaluation results, lucky #13: aquif-ai/aquif-3.5-8B-Think

Built on top of the solid Qwen3-8B foundation, aquif-3.5-8B-Think successfully preserves the high performance of the original model while consuming 30-50% less reasoning tokens.

The most notable regression vs the base model here is in arithmetic - if your workload is math heavy this model demonstrates an unfortunate collapse with performance under growing complexity.

The interesting combination of awesome overall performance on SVG simple shapes identification coupled with a total inability to recognize more complex shapes like 'House' or 'Arrow' is a behavior directly inherited from the base model (but with a ~20% improvement in token utilization).

If you like your reasoning models token-efficient, Aquif-3.5-8B-Think is well worth a spin.

Higher resolution, more detailed, interactive plots are available at the m12X explorer: https://reasonscape.com/m12x/explorer/

aquiffoo

12 days ago

Thanks for testing our model on your eval! It's an honor, really. It's also the smallest model in the top 10 for Hard mode

In this post