QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search Paper • 2602.09901 • Published 5 days ago • 6
QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search Paper • 2602.09901 • Published 5 days ago • 6
QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search Paper • 2602.09901 • Published 5 days ago • 6
Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training Paper • 2602.00747 • Published 15 days ago • 9
Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training Paper • 2602.00747 • Published 15 days ago • 9
Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training Paper • 2602.00747 • Published 15 days ago • 9
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 20
Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors Paper • 2601.15625 • Published 24 days ago • 8
Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors Paper • 2601.15625 • Published 24 days ago • 8
Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors Paper • 2601.15625 • Published 24 days ago • 8
nvidia/Nemotron-Instruction-Following-Chat-v1 Viewer • Updated Dec 15, 2025 • 288k • 1.34k • 119
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 20
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 20 • 2