purplelightning 's Collections QuantAgents
updated
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world
Markets?
Paper
• 2510.02209
• Published • 57
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial
Trading
Paper
• 2509.05080
• Published
TradingGroup: A Multi-Agent Trading System with Self-Reflection and
Data-Synthesis
Paper
• 2508.17565
• Published • 1
QTMRL: An Agent for Quantitative Trading Decision-Making Based on
Multi-Indicator Guided Reinforcement Learning
Paper
• 2508.20467
• Published
AlphaAgents: Large Language Model based Multi-Agents for Equity
Portfolio Constructions
Paper
• 2508.11152
• Published • 1
Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated
Alphas in Quant Trading
Paper
• 2509.01393
• Published
QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading
Paper
• 2509.09995
• Published • 16
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Paper
• 2512.15687
• Published • 22
LongCat-Flash-Thinking-2601 Technical Report
Paper
• 2601.16725
• Published • 180
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper
• 2601.18778
• Published • 42
Self-Distillation Enables Continual Learning
Paper
• 2601.19897
• Published • 36
Agentic Reasoning for Large Language Models
Paper
• 2601.12538
• Published • 204
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper
• 2601.16206
• Published • 87
Learning to Discover at Test Time
Paper
• 2601.16175
• Published • 45
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text
• 28B • Updated • 195k
• • 2.85k
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery
Paper
• 2601.19325
• Published • 81
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization
Paper
• 2601.21358
• Published • 7
Text Generation
• 2B • Updated • 39
• • 2
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling
Paper
• 2603.04791
• Published • 20
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs
Paper
• 2603.09906
• Published • 75
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text
• 10B • Updated • 4.51k
• 167
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
Paper
• 2603.22117
• Published • 29
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
Paper
• 2603.21065
• Published • 77
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper
• 2603.13398
• Published • 155
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper
• 2603.08262
• Published • 42
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper
• 2604.01658
• Published • 55
QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies
Paper
• 2604.15151
• Published • 16
Generative Recursive Reasoning
Paper
• 2605.19376
• Published • 25
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration
Paper
• 2605.20025
• Published • 132
NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized Research Automation
Paper
• 2605.10813
• Published • 13
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
Paper
• 2605.08083
• Published • 66