Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper โข 2410.17243 โข Published Oct 22, 2024 โข 90
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages Paper โข 2407.19672 โข Published Jul 29, 2024 โข 56