Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 3 days ago • 46
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 3 days ago • 161
AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models Paper • 2511.10017 • Published 23 days ago • 6
MuSc-V2: Zero-Shot Multimodal Industrial Anomaly Classification and Segmentation with Mutual Scoring of Unlabeled Samples Paper • 2511.10047 • Published 22 days ago • 1
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents Paper • 2511.07685 • Published 25 days ago • 9
Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation Paper • 2511.10547 • Published 22 days ago • 4
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published 22 days ago • 10
Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following Paper • 2511.10507 • Published 22 days ago • 5
AlphaResearch: Accelerating New Algorithm Discovery with Language Models Paper • 2511.08522 • Published 24 days ago • 15
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training Paper • 2511.01918 • Published Nov 1 • 11
Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO Paper • 2511.09780 • Published 23 days ago • 26
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 22 days ago • 46
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 23 days ago • 75
Stemming Hallucination in Language Models Using a Licensing Oracle Paper • 2511.06073 • Published 27 days ago • 1
Agentic Refactoring: An Empirical Study of AI Coding Agents Paper • 2511.04824 • Published 29 days ago • 4
Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance Paper • 2511.07499 • Published 25 days ago • 5
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models Paper • 2511.09515 • Published 23 days ago • 17