Deqing Fu PRO
deqing
AI & ML interests
None yet
Recent Activity
published a model 24 minutes ago
deqing/vanilla-llama-3.2-1B-dclm-100BT-v1 updated a model about 2 hours ago
deqing/vanilla-llama-3.2-1B-fineweb-sample-100BT-v4 upvoted a paper about 2 hours ago
Convergent Evolution: How Different Language Models Learn Similar Number Representations