Felix Fuentes
ffuhu
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 18 hours ago
The Curse of Depth in Large Language Models
upvoted
a
paper
about 18 hours ago
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and
Post-LN
upvoted
a
paper
about 18 hours ago
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Organizations
models
None public yet
datasets
None public yet