87 65 282

Lee Junbum PRO

beomi

https://junbuml.ee

AI & ML interests

AI/ML GDE. Advancing Low-Resource Language Open Access LLM

Recent Activity

upvoted a paper about 13 hours ago

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

upvoted a paper about 13 hours ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper 3 days ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

View all activity

Organizations

beomi's activity

upvoted 2 papers about 13 hours ago

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published 5 days ago • 35

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 4 days ago • 166

upvoted 3 papers 3 days ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Paper • 2502.06533 • Published 7 days ago • 16

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 6 days ago • 38

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 5 days ago • 125

liked a dataset 6 days ago

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15 • 25k • 4.58k • 15

liked a dataset 7 days ago

exp-models/korean-reasoning-mixture-20250203-preview

Viewer • Updated 14 days ago • 62.9k • 61 • 6

liked 2 datasets 8 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 3 days ago • 228k • 52.3k • 522

ANTEGRAL/korean-persona-chat-v1

Viewer • Updated 10 days ago • 991 • 21 • 8

upvoted a paper 10 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 12 days ago • 12

upvoted a paper 11 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 13 days ago • 175

liked a dataset 13 days ago

simplescaling/s1K

Viewer • Updated 7 days ago • 1k • 3.06k • 170

upvoted a paper 14 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 17 days ago • 102

liked a model 15 days ago

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated 15 days ago • 661k • • 771

upvoted an article 15 days ago

Article

Welcome to Inference Providers on the Hub 🔥

21 days ago

• 371

upvoted a paper 17 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 18 days ago • 53

upvoted a paper 18 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 19 days ago • 54

liked a model 19 days ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated 19 days ago • 29.3k • 244

liked a Space 19 days ago

526

Qwen2.5 Max Demo

🐢

Send messages for chatbot responses

updated a collection 24 days ago

Korean Instruction Dataset

Collection

5 items • Updated 24 days ago • 7