nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation β’ 32B β’ Updated about 1 month ago β’ 1.57M β’ 712
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation β’ 33B β’ Updated Jan 6 β’ 60.6k β’ 396
naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-0.5B Text Generation β’ 0.6B β’ Updated Jul 21, 2025 β’ 3.1k β’ 83
naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B Text Generation β’ Updated Sep 16, 2025 β’ 60.4k β’ 219
naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-1.5B Text Generation β’ 2B β’ Updated Oct 2, 2025 β’ 2.65k β’ 154
nvidia/Llama-3.1-Nemotron-Nano-8B-v1 Text Generation β’ 8B β’ Updated Oct 15, 2025 β’ 74.3k β’ β’ 221
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated Aug 27, 2025 β’ 31.9k β’ 112
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated Aug 27, 2025 β’ 31.9k β’ 112
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7, 2025 β’ 286