timm/vit_tiny_patch16_dinov3_qkvb.eupe_lvd1689m Image Feature Extraction • 5.49M • Updated 12 days ago • 59 • 2
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 16 days ago • 35
Draft-OPD: On-Policy Distillation for Speculative Draft Models Paper • 2605.29343 • Published 12 days ago • 32