pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated about 21 hours ago • 46
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 3 days ago • 83
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 15 days ago • 47
SkillOrchestra: Learning to Route Agents via Skill Transfer Paper • 2602.19672 • Published 4 days ago • 51
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 18 days ago • 211
Creative Writing Datasets Collection High-quality creative writing and storytelling data. • 35 items • Updated 3 days ago • 3
Instruction & Reasoning Collection Datasets for instruction following, code, and reasoning. • 13 items • Updated about 23 hours ago • 7
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 13 days ago • 43
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 27 items • Updated 3 days ago • 31
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 16 days ago • 52
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published 14 days ago • 146
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 17 days ago • 197
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 15 days ago • 30