Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation Paper • 2501.15907 • Published 4 days ago • 14
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation Paper • 2407.05361 • Published Jul 7, 2024 • 2
SpMis: An Investigation of Synthetic Spoken Misinformation Detection Paper • 2409.11308 • Published Sep 17, 2024
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 56
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications Paper • 2408.11878 • Published Aug 20, 2024 • 54
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper • 2406.19280 • Published Jun 27, 2024 • 62
Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation Paper • 2106.15078 • Published Jun 29, 2021
Pandora: Towards General World Model with Natural Language Actions and Video States Paper • 2406.09455 • Published Jun 12, 2024 • 15
VDC: Versatile Data Cleanser for Detecting Dirty Samples via Visual-Linguistic Inconsistency Paper • 2309.16211 • Published Sep 28, 2023
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper • 2312.09911 • Published Dec 15, 2023 • 54