arxiv:2411.19574
xumingyu
xumingyu16
·
AI & ML interests
None yet
Recent Activity
updated
a model
17 days ago
xumingyu16/KV_shifting_2.9B
upvoted
a
collection
20 days ago
Attention
commented
a paper
24 days ago
KV Shifting Attention Enhances Language Modeling
Organizations
datasets
None public yet