A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports Paper • 2510.02190 • Published 11 days ago • 18
UniVideo: Unified Understanding, Generation, and Editing for Videos Paper • 2510.08377 • Published 4 days ago • 62
Modeling All-Atom Glycan Structures via Hierarchical Message Passing and Multi-Scale Pre-training Paper • 2506.01376 • Published Jun 2
VideoScore2: Think before You Score in Generative Video Evaluation Paper • 2509.22799 • Published 17 days ago • 24
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published 17 days ago • 66
Hallucination Score: Towards Mitigating Hallucinations in Generative Image Super-Resolution Paper • 2507.14367 • Published Jul 18
PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models Paper • 2505.22523 • Published May 28 • 7
IEMaster/OmniEdit-Filtered-1.2M-Step1X-Distill-20250711-Data115500 Viewer • Updated Jul 11 • 111k • 8
IEMaster/OmniEdit-Filtered-1.2M-Step1X-Distill-20250711-Data115500 Viewer • Updated Jul 11 • 111k • 8
IEMaster/WorldEditorDataset-test4-Filtered-threshold_10_10_total5729-20250708 Viewer • Updated Jul 7 • 5.73k • 5
IEMaster/WorldEditorDataset-test4-Filtered-threshold_10_10_total5729-20250708 Viewer • Updated Jul 7 • 5.73k • 5
IEMaster/WorldEditorDataset-test4-Filtered-threshold_9_9_total36938-20250705 Viewer • Updated Jul 5 • 36.9k • 6
IEMaster/WorldEditorDataset-test4-Filtered-threshold_8_8_total52974-20250705 Viewer • Updated Jul 5 • 53k • 6
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published May 21 • 53
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation Paper • 2506.03930 • Published Jun 4 • 26
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Paper • 2506.03295 • Published Jun 3 • 17
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Paper • 2505.20139 • Published May 26 • 19
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation Paper • 2505.14640 • Published May 20 • 16
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations Paper • 2504.00824 • Published Apr 1 • 43