14 49

Matt Barr

mattbarr

marr75

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

commented a paper about 1 month ago

Top-$nσ$: Not All Logits Are You Need

commented a paper about 1 month ago

Drowning in Documents: Consequences of Scaling Reranker Inference

View all activity

Organizations

mattbarr's activity

upvoted a paper 11 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 15 days ago • 77

upvoted 2 papers about 1 month ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12 • 18

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18 • 17

upvoted a paper 2 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22 • 89

upvoted a paper 4 months ago

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3 • 35

upvoted a paper 6 months ago

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

Paper • 2406.08451 • Published Jun 12 • 23

upvoted an article 7 months ago

Article

A Complete Guide to Audio Datasets

Dec 15, 2022

• 21

upvoted 2 papers 7 months ago

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 149

upvoted a paper 8 months ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 59

upvoted 2 papers 9 months ago

Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4 • 21

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 104

upvoted 5 papers 10 months ago

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27 • 88

Ring Attention with Blockwise Transformers for Near-Infinite Context

Paper • 2310.01889 • Published Oct 3, 2023 • 10

upvoted a paper 12 months ago

LLaMA Pro: Progressive LLaMA with Block Expansion

Paper • 2401.02415 • Published Jan 4 • 53

upvoted 2 papers about 1 year ago

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator

Paper • 2312.04474 • Published Dec 7, 2023 • 30

MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

Paper • 2311.12052 • Published Nov 18, 2023 • 31