Weijing Huang's picture

3 7 40

Weijing Huang

waleking

AI & ML interests

Language Models

Recent Activity

liked a dataset 11 days ago

OpenStellarTeam/Chinese-SimpleQA

liked a dataset 18 days ago

allenai/olmOCR-mix-0225

upvoted a paper 29 days ago

TransMLA: Multi-head Latent Attention Is All You Need

View all activity

Organizations

None yet

waleking's activity

upvoted a paper 29 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 47

upvoted a paper about 1 month ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5 • 15

upvoted an article about 1 month ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

Jan 31

• 39

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted an article 3 months ago

Article

Deriving DPO's Loss

By

•

Dec 24, 2024

• 26