7 9 20

Louis Ulmer

lulmer

lulmer

AI & ML interests

NLP (semantic search, topic generation) Computer vision (object detection) Diffusion Models

Recent Activity

upvoted a paper about 2 months ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

upvoted an article 2 months ago

Bringing Fusion Down to Earth: ML for Stellarator Optimization

liked a model 2 months ago

black-forest-labs/FLUX.1-dev

View all activity

Organizations

upvoted a paper about 2 months ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Paper • 2507.10524 • Published Jul 14 • 69

upvoted an article 2 months ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

•

Jul 2

• 73

liked a model 2 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.39M • • 11.4k

New activity in Qwen/Qwen2.5-VL-7B-Instruct 3 months ago

Exception: Could not find the transformer layer class to wrap in the model.

👍 3

#2 opened 7 months ago by

atishay-scribe

upvoted an article 3 months ago

Article

🐯 Liger GRPO meets TRL

and 5 others •

May 25

• 49

liked a Space 3 months ago

3.15k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 4 months ago

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

and 5 others •

May 21

• 34

liked a dataset 4 months ago

MaziyarPanahi/OpenMathReasoning_ShareGPT

Viewer • Updated Apr 24 • 2.4M • 341 • 2

liked 4 datasets 5 months ago

liked a model 6 months ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12 • 138k • • 1.93k

upvoted a paper 6 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 139

liked 2 datasets 6 months ago

Josephgflowers/Finance-Instruct-500k

Viewer • Updated Mar 1 • 518k • 1.89k • 155

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated Mar 25 • 251k • 6.11k • 199

liked a model 8 months ago

lightblue/lb-reranker-0.5B-v1.0

Text Generation • 0.5B • Updated Jan 21 • 1.22k • 72

upvoted a collection 10 months ago

Hymba

Collection

A series of Hybrid Small Language Models. • 3 items • Updated 3 days ago • 31

liked a model 10 months ago

chentong00/propositionizer-wiki-flan-t5-large

0.8B • Updated Dec 13, 2023 • 4.97k • 47

upvoted a paper 11 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 152

Louis Ulmer

AI & ML interests

Recent Activity

Organizations

lulmer's activity

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Exception: Could not find the transformer layer class to wrap in the model.

🐯 Liger GRPO meets TRL

The Ultra-Scale Playbook

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance