-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 41 -
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Paper • 2412.14161 • Published • 44 -
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Paper • 2408.10945 • Published • 9 -
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 53
Ron Wolf
ron-wolf
·
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
3 days ago
EVA Gen 0.0
liked
a model
3 days ago
EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2
upvoted
a
collection
3 days ago
Recommended large models
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet