Gabriele Serussi's picture

4

Gabriele Serussi

GSerussi

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Optimizing Large Language Model Training Using FP4 Quantization

upvoted a paper 2 months ago

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

upvoted a paper 2 months ago

Nested Attention: Semantic-aware Attention Values for Concept Personalization

View all activity

Organizations

None yet

GSerussi's activity

upvoted a paper about 1 month ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 36

upvoted 2 papers 2 months ago

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Paper • 2501.00658 • Published Dec 31, 2024 • 7

Nested Attention: Semantic-aware Attention Values for Concept Personalization

Paper • 2501.01407 • Published Jan 2 • 11

upvoted a paper 3 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 93