Peter
Tempo14
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
updated
a collection
6 days ago
3D
updated
a collection
11 days ago
new architecture
Organizations
None yet
Collections
56
-
Selective Attention Improves Transformer
Paper • 2410.02703 • Published • 23 -
Differential Transformer
Paper • 2410.05258 • Published • 166 -
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Paper • 2410.05076 • Published • 6 -
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs
Paper • 2410.13276 • Published • 25
models
5
datasets
None public yet