Hu

Alexhu1999

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 9 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

published a dataset 3 months ago

Alexhu1999/qwen3_embedings

View all activity

Organizations

upvoted 2 papers 9 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 13 days ago • 141

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 12 days ago • 82

published a dataset 3 months ago

Alexhu1999/qwen3_embedings

Updated Nov 2, 2025

updated a Space 3 months ago

Trackio

🚀

Track and visualize project metrics

published a Space 3 months ago

Trackio

🚀

Track and visualize project metrics

updated a model 4 months ago

Alexhu1999/cerebras_kto_iseuiuc

Updated Sep 21, 2025

published a model 4 months ago

Alexhu1999/cerebras_kto_iseuiuc

Updated Sep 21, 2025

updated a dataset 4 months ago

Alexhu1999/maicrl

Updated Sep 19, 2025

published a dataset 4 months ago

Alexhu1999/maicrl

Updated Sep 19, 2025

liked 5 datasets 4 months ago

updated a model 5 months ago

Alexhu1999/lfm2_vl

1B • Updated Sep 1, 2025 • 1

liked a model 5 months ago

NexaAI/OmniNeural-4B

Any-to-Any • Updated Nov 7, 2025 • 26 • 160

updated a model 5 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever

4B • Updated Aug 15, 2025

updated a model 6 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever-120steps

4B • Updated Aug 14, 2025

published a model 6 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever-120steps

4B • Updated Aug 14, 2025

updated a model 6 months ago

Alexhu1999/Qwen3-4B-DAPO-email-retriever

4B • Updated Aug 13, 2025

Hu

AI & ML interests

Recent Activity

Organizations

Alexhu1999's activity

Trackio

Trackio