Fix vision_config.model_type

#21

by YangKai0616 - opened 15 days ago

base: refs/heads/main

←

from: refs/pr/21

Discussion Files changed

-1

YangKai0616

15 days ago

Fixed the model_type error in vision_config. This PR allows the visual model to be loaded correctly instead of loading the entire GLM-4V model.

A simple reproduction script is as follows:

import torch
from transformers import Glm46VForConditionalGeneration


def main():
    model = Glm46VForConditionalGeneration.from_pretrained(
        "THUDM/GLM-4.1V-9B-Thinking",
        dtype="auto",
        device_map="auto",
    )

    visual = model.model.visual
    language = model.model.language_model

    print(f"type(model.model.visual): {type(visual)}")
    print(f"type(model.model.language_model): {type(language)}")

    assert visual.__class__.__name__ == "Glm4vVisionModel", (
        "vision_config mistakenly sets model_type='glm4v', so AutoModel.from_config "
        "instantiates the full Glm4vModel (visual + language) instead of the pure "
        "vision backbone Glm4vVisionModel."
    )


if __name__ == "__main__":
    main()

Fix vision_config.model_typea590715e

YangKai0616

15 days ago

Hi @merve , please help review. Thanks!

ZHANGYUXUAN-zR

Z.ai org 10 days ago

This is an issue that was also fixed in the PR for GLM-4.6, which required renaming the model_type as shown in that PR. Therefore, this field will be updated when transformers 5.0.0 is released. For transformers 4.57.1, glm4_vision does not yet exist, so it will not be merged for now.

YangKai0616

10 days ago

This is an issue that was also fixed in the PR for GLM-4.6, which required renaming the model_type as shown in that PR. Therefore, this field will be updated when transformers 5.0.0 is released. For transformers 4.57.1, glm4_vision does not yet exist, so it will not be merged for now.

Understood! Thanks for the clarification. I'll close this PR.

YangKai0616 changed pull request status to closed 10 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment