view article Article Model statistics of the 50 most downloaded entities on Hugging Face By lbourdois • 3 days ago • 21
BERT Hash Nano Models Collection Set of BERT models with a modified embeddings layer • 3 items • Updated 9 days ago • 8
view article Article ModernVBERT: Towards Smaller Visual Document Retrievers By paultltc and 4 others • 13 days ago • 39
view article Article Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 14 days ago • 8
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT Paper • 2509.19284 • Published 23 days ago • 22
view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation Sep 2 • 66
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation? Paper • 2508.19827 • Published Aug 27 • 33
Splade Models Collection The collection includes Splade models from different authors that can be load thanks to the Sparse Encoder modules of Sentence Transformers • 16 items • Updated Jul 30 • 8
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1 • 123
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published Jul 14 • 88
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Paper • 2506.24119 • Published Jun 30 • 50
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 215
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published May 12 • 82