-
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Paper • 2309.09958 • Published • 18 -
TextBind: Multi-turn Interleaved Multimodal Instruction-following
Paper • 2309.08637 • Published • 7 -
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Paper • 2309.16058 • Published • 55 -
Qwen Technical Report
Paper • 2309.16609 • Published • 34
WANG Jiong
wjwow
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
20 days ago
wjwow/FreeMan
upvoted
a
paper
about 1 month ago
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion
Model
Organizations
None yet
Collections
2
models
None public yet