WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published 20 days ago • 3
WAON Collection WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 5 items • Updated 17 days ago • 1
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18 • 1