gpt-oss-20b-Malaysian-Reasoning-SFT-v0.1
LoRA SFT openai/gpt-oss-20b on initial mesolitica/Malaysian-Reasoning
- Use
kernels-community/vllm-flash-attn3for Flash Attention 3 with Sink. - All linear layers with rank 16 alpha 32.
Source code
Source code at https://github.com/Scicom-AI-Enterprise-Organization/small-ablation/blob/main/malaysian-reasoning/20b.sh
Acknowledgement
Special thanks to https://www.scitix.ai/ for H100 Node!
- Downloads last month
- 81