Align-TI Collection This is the official set of weights for the paper “Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions.” • 4 items • Updated 1 day ago • 1
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning Paper • 2508.21113 • Published Aug 28 • 109