Zeyu Qin

qqqzzzyyy

https://alan-qin.github.io/

Alan-Qin

AI & ML interests

Trustworthy ML, AI safety

Recent Activity

upvoted a paper 5 days ago

s1: Simple test-time scaling

liked a model 11 days ago

LLM-LAT/robust-llama3-8b-instruct

upvoted a paper 17 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

View all activity

Organizations

None yet

qqqzzzyyy's activity

upvoted a paper 5 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 9 days ago • 97

upvoted a paper 17 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 18 days ago • 43

upvoted a paper about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

upvoted 2 papers 3 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77

upvoted a collection 5 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated 27 days ago • 70

upvoted a paper 5 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 48

upvoted 2 articles 6 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 150

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 310

upvoted 3 collections 7 months ago

upvoted a paper 8 months ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 68

upvoted 2 papers 10 months ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21, 2024 • 29

Localizing Paragraph Memorization in Language Models

Paper • 2403.19851 • Published Mar 28, 2024 • 15

upvoted a paper 11 months ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

upvoted a collection 11 months ago

Handbook v0.1 models and datasets

Collection

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24

upvoted a paper 11 months ago

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 17

upvoted 2 papers 12 months ago

Beyond Training Objectives: Interpreting Reward Model Divergence in Large Language Models

Paper • 2310.08164 • Published Oct 12, 2023 • 4

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Paper • 2402.07610 • Published Feb 12, 2024 • 8