AXONVERTEX-AI-RESEARCH/Orchestrator-8B-Q8_0-GGUF Reinforcement Learning • 8B • Updated 5 days ago • 269 • 6
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 4.44k • 187
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated 21 days ago • 63 • 2