This model is the stage 1 checkpoint of one of the thirteen settings, DiT, used in the Law of Vision Representation in MLLMs.

Downloads last month
10
Inference API
Inference API (serverless) does not yet support transformers models for this pipeline type.