Yuxuan Wang's picture

Yuxuan Wang

ColorfulAI

·

https://patrick-tssn.github.io/

patrick-tssn

AI & ML interests

Multimodal Learning

Organizations

Collections 2

Papers 22

arxiv:2511.21631

arxiv:2510.12720

arxiv:2510.10689

arxiv:2509.25773

models 10

ColorfulAI/M4-Audio-LongVA-7B-Qwen2

Video-Text-to-Text • 9B • Updated Apr 3, 2025 • 7

ColorfulAI/M4-LongVA-7B-Qwen2

Video-Text-to-Text • 8B • Updated Apr 3, 2025 • 15

ColorfulAI/OpenOmni-8B-Llama3-Omni

9B • Updated Apr 2, 2025 • 5 • 1

ColorfulAI/OpenOmni-7B-Qwen2-Omni

9B • Updated Apr 2, 2025 • 7 • 1

ColorfulAI/LongVA-7B-Qwen2-Audio

9B • Updated Apr 1, 2025 • 3

ColorfulAI/LongVA-7B-Qwen2-VoiceAssistant

9B • Updated Apr 1, 2025 • 2

ColorfulAI/Llama-3.1-8B-S2S-Omni

9B • Updated Apr 1, 2025 • 2

ColorfulAI/videollamb-llava-1.5-7b

Video-Text-to-Text • 7B • Updated Sep 9, 2024 • 36 • 4

ColorfulAI/videollamb-mem-llava-1.5-7b

7B • Updated Aug 12, 2024 • 3

ColorfulAI/LSTP-Chat

Image-Text-to-Text • Updated Aug 2, 2024 • 4

datasets 7

ColorfulAI/MoviePuzzle

Viewer • Updated May 14, 2025 • 1 • 12

ColorfulAI/M4-IT

Updated Apr 3, 2025 • 93 • 1

ColorfulAI/VoiceAssistant_units

Viewer • Updated Apr 2, 2025 • 428k • 11

ColorfulAI/LLaVA-NeXT-Speech

Updated Apr 1, 2025 • 40

ColorfulAI/NeedleInAVideoHaystack

Viewer • Updated Jan 22, 2025 • 21 • 63

ColorfulAI/EgoPlan_test

Viewer • Updated Sep 15, 2024 • 923 • 87

ColorfulAI/VideoLLaMB-IT

Viewer • Updated Aug 12, 2024 • 1.03M • 24