周 昊天
zhqi6m
AI & ML interests
None yet
Recent Activity
upvoted a paper about 11 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards liked a model about 19 hours ago
tencent/Hy-MT2-1.8BOrganizations
None yet