arxiv:2605.16839
Dongwon Jo
dongwonjo
AI & ML interests
Efficient AI, Model Compression, Sparse Attention, Quantization, Pruning, Generative Model, Large Language Model, Diffusion
Recent Activity
upvoted a paper about 13 hours ago
CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection authored a paper about 13 hours ago
CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection upvoted a paper 3 months ago
Squeezing Large-Scale Diffusion Models for Mobile