@DavidAU on Hugging Face: "The "ERNIE" 21B MOE Distill High Reasoning Fine Tune Invasion: 3 Ernie…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

DavidAU

posted an update 8 days ago

Post

4670

The "ERNIE" 21B MOE Distill High Reasoning Fine Tune Invasion:

3 Ernie 21B-A3B MOE Models (64 experts) fine tuned with Unsloth using Gemini Pro 3, Claude 4.5 Opus, and GLM 4.7 Flash high reasoning datasets.

All benched, all exceeding org model specs too.

https://huggingface.co/DavidAU/models?search=ernie

Enjoy the freedom and added power.

sometimesanotion

8 days ago

Neat! I'd love to see one of these fine-tuned on a tool-calling dataset.

In this post

DavidAU David Belton
sometimesanotion sometimesanotion