PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17 • 1
HMoE: Heterogeneous Mixture of Experts for Language Modeling Paper • 2408.10681 • Published Aug 20 • 8
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2 • 44
PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17 • 1
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication Paper • 2402.18439 • Published Feb 28
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors Paper • 2308.10848 • Published Aug 21, 2023 • 1
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs Paper • 2307.16789 • Published Jul 31, 2023 • 98
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models Paper • 2403.08281 • Published Mar 13
Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models Paper • 2310.12818 • Published Oct 19, 2023
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published Nov 4 • 24
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published Nov 4 • 24
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Paper • 2407.07061 • Published Jul 9 • 26
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Paper • 2407.07061 • Published Jul 9 • 26
AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability Paper • 2405.14129 • Published May 23 • 12
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper • 2405.12981 • Published May 21 • 28