14 49

Matt Barr

mattbarr

marr75

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

commented a paper about 1 month ago

Top-$nσ$: Not All Logits Are You Need

commented a paper about 1 month ago

Drowning in Documents: Consequences of Scaling Reranker Inference

View all activity

Organizations

mattbarr's activity

upvoted a paper 11 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 15 days ago • 77

commented 2 papers about 1 month ago

Top-$nσ$: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12 • 18 •

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18 • 17 •

upvoted 2 papers about 1 month ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12 • 18

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18 • 17

upvoted a paper 2 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22 • 89

upvoted a paper 4 months ago

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3 • 35

commented a paper 5 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 82 •

upvoted a paper 6 months ago

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

Paper • 2406.08451 • Published Jun 12 • 23

upvoted an article 7 months ago

Article

A Complete Guide to Audio Datasets

Dec 15, 2022

• 21

commented a paper 7 months ago

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39 •

upvoted a paper 7 months ago

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

commented a paper 7 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 149 •

upvoted a paper 7 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 149

upvoted a paper 8 months ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 59

upvoted a paper 9 months ago

Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4 • 21

commented a paper 9 months ago

Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2 • 35 •

upvoted a paper 9 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 104

upvoted 2 papers 10 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 604

Sora Generates Videos with Stunning Geometrical Consistency

Paper • 2402.17403 • Published Feb 27 • 16