124 645 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

commented on a paper 3 days ago

Why Language Models Hallucinate

upvoted a paper 4 days ago

Why Language Models Hallucinate

View all activity

Organizations

None yet

commented a paper 3 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 7 days ago • 151 •

commented a paper 4 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 7 days ago • 151 •

commented 2 papers 21 days ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published 28 days ago • 8 •

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published 28 days ago • 8 •

commented a paper 22 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 23 days ago • 36 •

commented a paper about 1 month ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 235 •

commented 5 papers about 2 months ago

commented 2 papers 4 months ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 23 •

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 23 •

commented 5 papers 5 months ago

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Paper • 2504.05262 • Published Apr 7 • 11 •

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Paper • 2504.05262 • Published Apr 7 • 11 •

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 130 •

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 87 •

Generative Evaluation of Complex Reasoning in Large Language Models

Paper • 2504.02810 • Published Apr 3 • 14 •

commented 2 papers 6 months ago

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 25 •

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 25 •

Michael Barry

AI & ML interests

Recent Activity

Organizations

MichaelBarryUK's activity