Unifying Multimodal Retrieval via Document Screenshot Embedding Paper • 2406.11251 • Published Jun 17 • 9
Vietnamese speech dataset Collection for speech-related tasks: speech-to-text & text-to-speech • 25 items • Updated Oct 6 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 7
⚓️ Sailor Language Models Collection Sailor: Open Language Models tailored for South-East Asia (SEA) released by Sea AI Lab. • 18 items • Updated Jul 26 • 16
Optimized Network Architectures for Large Language Model Training with Billions of Parameters Paper • 2307.12169 • Published Jul 22, 2023 • 9
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 25