1 6

Jiayi Yuan

jy-yuan

jy-yuan

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Transformers backend integration in SGLang

upvoted a paper about 2 months ago

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

authored a paper 3 months ago

Robust Tickets Can Transfer Better: Drawing More Transferable Subnetworks in Transfer Learning

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Transformers backend integration in SGLang

and 4 others •

Jun 23

• 53

upvoted a paper about 2 months ago

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Paper • 2402.02750 • Published Feb 5, 2024 • 4

authored 7 papers 3 months ago

Robust Tickets Can Transfer Better: Drawing More Transferable Subnetworks in Transfer Learning

Paper • 2304.11834 • Published Apr 24, 2023 • 1

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Paper • 2402.02750 • Published Feb 5, 2024 • 4

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

Paper • 2407.01527 • Published Jul 1, 2024

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Paper • 2506.09501 • Published Jun 11 • 18

upvoted 2 papers 3 months ago

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models

Paper • 2505.22662 • Published May 28 • 6

The Science of Evaluating Foundation Models

Paper • 2502.09670 • Published Feb 12 • 1

commented a paper 3 months ago

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Paper • 2506.09501 • Published Jun 11 • 18 •

upvoted a paper 3 months ago

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Paper • 2506.09501 • Published Jun 11 • 18

authored a paper 6 months ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published Mar 20 • 76

upvoted a paper 6 months ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published Mar 20 • 76

Jiayi Yuan

AI & ML interests

Recent Activity

Organizations

jy-yuan's activity

Transformers backend integration in SGLang