124 644 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

commented on a paper about 21 hours ago

Why Language Models Hallucinate

upvoted a paper 2 days ago

Why Language Models Hallucinate

commented on a paper 2 days ago

Why Language Models Hallucinate

View all activity

Organizations

None yet

commented a paper about 21 hours ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 5 days ago • 129 •

upvoted a paper 2 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 5 days ago • 129

commented a paper 2 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 5 days ago • 129 •

upvoted a paper 9 days ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published 16 days ago • 14

commented 2 papers 19 days ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published 26 days ago • 8 •

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published 26 days ago • 8 •

upvoted a paper 20 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 21 days ago • 36

commented a paper 20 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 21 days ago • 36 •

commented a paper about 1 month ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 235 •

upvoted 8 papers about 1 month ago

Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe

Paper • 2508.01691 • Published Aug 3 • 9

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3 • 13

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published Aug 4 • 17

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published Aug 1 • 33

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 243

commented 3 papers about 2 months ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 119 •

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 119 •

Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation

Paper • 2507.09074 • Published Jul 11 • 6 •

Michael Barry

AI & ML interests

Recent Activity

Organizations

MichaelBarryUK's activity