Doge family of small language models.
Loser Cheems
JingzeShi
AI & ML interests
I like training small languge models.
Recent Activity
posted
an
update
about 1 month ago
Is it time to start developing sparse attention again?
https://github.com/SmallDoges/flash-sparse-attention
upvoted
a
paper
about 2 months ago
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
upvoted
an
article
about 2 months ago
From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels