Guangxuan Xiao's picture

3 3 3

Guangxuan Xiao

Guangxuan-Xiao

·

http://guangxuanx.com

Guangxuan-Xiao

AI & ML interests

Efficient Machine Learning

Recent Activity

upvoted a collection 6 days ago

🧠 Reasoning datasets

authored a paper 22 days ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

authored a paper 5 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

View all activity

Organizations

Guangxuan-Xiao's activity

commented a paper 5 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 7 •

New activity in mit-han-lab/opt-13b-smoothquant over 2 years ago

how to load and use model?

#1 opened over 2 years ago by