Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM Paper • 2512.21580 • Published Dec 25, 2025 • 8 • 4
Knowing Isn't Understanding: Re-grounding Generative Proactivity with Epistemic and Behavioral Insight Paper • 2602.15259 • Published 3 days ago • 2
The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems Paper • 2602.15382 • Published 2 days ago • 2 • 2
Visual Persuasion: What Influences Decisions of Vision-Language Models? Paper • 2602.15278 • Published 3 days ago • 3 • 2
Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models Paper • 2602.15772 • Published 2 days ago • 6 • 2
Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? Paper • 2602.14111 • Published 4 days ago • 54 • 3
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published 2 days ago • 11 • 2
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities Paper • 2602.15327 • Published 3 days ago • 2 • 2
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published 6 days ago • 44 • 4
HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam Paper • 2602.13964 • Published 5 days ago • 2 • 2
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 4 days ago • 24 • 4
Causal-JEPA: Learning World Models through Object-Level Latent Interventions Paper • 2602.11389 • Published 8 days ago • 4 • 2
TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models Paper • 2602.15449 • Published 2 days ago • 5 • 3
STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens Paper • 2602.15620 • Published 2 days ago • 3 • 2
A Trajectory-Based Safety Audit of Clawdbot (OpenClaw) Paper • 2602.14364 • Published 4 days ago • 16 • 2
Geometry-Aware Rotary Position Embedding for Consistent Video World Model Paper • 2602.07854 • Published 11 days ago • 2 • 2
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers Paper • 2602.15322 • Published 3 days ago • 7 • 2