OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper β’ 2501.09751 β’ Published 1 day ago β’ 29
Phi-4 (All Versions) Collection Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. β’ 4 items β’ Updated 5 days ago β’ 28
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Paper β’ 2501.03226 β’ Published 12 days ago β’ 35
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper β’ 2501.00958 β’ Published 16 days ago β’ 95
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Paper β’ 2501.01895 β’ Published 15 days ago β’ 48
Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 43
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 16 days ago β’ 37
How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation Paper β’ 2412.18573 β’ Published 25 days ago β’ 1
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines Paper β’ 2310.03714 β’ Published Oct 5, 2023 β’ 33
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper β’ 2412.14922 β’ Published 30 days ago β’ 85
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper β’ 2412.18925 β’ Published 24 days ago β’ 94
YuLan-Mini: An Open Data-efficient Language Model Paper β’ 2412.17743 β’ Published 26 days ago β’ 64