umangkaushik
ubermenchh
AI & ML interests
None yet
Organizations
models
33

ubermenchh/Qwen2.5-3B-open-r1-math
Text Generation
โข
3B
โข
Updated

ubermenchh/Qwen2.5-3B-open-r1-math-lora
Updated

ubermenchh/Qwen2.5-3B-openr1-math
Text Generation
โข
Updated

ubermenchh/Qwen2.5-0.5B-openr1-math
Updated

ubermenchh/llama3.1-8B-gsm8k-grpo
8B
โข
Updated

ubermenchh/SmolLM2-SFT-sarvam-samvaad
Text Generation
โข
0.2B
โข
Updated

ubermenchh/SmolLM2-360M-r1-grpo-countdown
Updated

ubermenchh/SmolLM2-DPO-ultrafeedback-binarized-preferences
Text Generation
โข
0.1B
โข
Updated
โข
1

ubermenchh/SmolLM2-DPO
Text Generation
โข
0.1B
โข
Updated

ubermenchh/SmolLM2-FT-the-smol-stack
Text Generation
โข
0.1B
โข
Updated