TFPI Collection Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners • 14 items • Updated Nov 7