Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
DavidAU 
posted an update 8 days ago
Post
4670
The "ERNIE" 21B MOE Distill High Reasoning Fine Tune Invasion:

3 Ernie 21B-A3B MOE Models (64 experts) fine tuned with Unsloth using Gemini Pro 3, Claude 4.5 Opus, and GLM 4.7 Flash high reasoning datasets.

All benched, all exceeding org model specs too.

https://huggingface.co/DavidAU/models?search=ernie

Enjoy the freedom and added power.

Neat! I'd love to see one of these fine-tuned on a tool-calling dataset.