JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization Paper β’ 2503.23377 β’ Published Mar 30 β’ 57