S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models Paper β’ 2604.01168 β’ Published 20 days ago β’ 7
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper β’ 2604.01591 β’ Published 20 days ago β’ 41
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper β’ 2604.04323 β’ Published 16 days ago β’ 41
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper β’ 2604.10098 β’ Published 11 days ago β’ 75
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper β’ 2604.12627 β’ Published 8 days ago β’ 98
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper β’ 2604.06132 β’ Published 15 days ago β’ 115
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper β’ 2604.08377 β’ Published 13 days ago β’ 282
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 6 days ago β’ 59