Fix AutoModel not loading model correctly due to config_class inconsistency

#26

by liamclarkza - opened May 27

base: refs/heads/main

←

from: refs/pr/26

Discussion Files changed

-1

liamclarkza

May 27

This fixes an issue when using AutoModel to instantiate the model where the config class instantiated with the model is from the transformers library instead of the model's module. This causes the instantiation to fail with the error below. See this Github issue for more details.

Traceback (most recent call last):
    model = AutoModel.from_pretrained("zhihan1996/DNABERT-2-117M", trust_remote_code=True)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File ".../lib/python3.11/site-packages/transformers/models/auto/auto_factory.py", line 560, in from_pretrained
    cls.register(config.__class__, model_class, exist_ok=True)
  File ".../lib/python3.11/site-packages/transformers/models/auto/auto_factory.py", line 586, in register
    raise ValueError(
ValueError: The model class you are passing has a `config_class` attribute that is not consistent with the config class you passed (model has <class 'transformers.models.bert.configuration_bert.BertConfig'> and you passed <class 'transformers_modules.zhihan1996.DNABERT-2-117M.d064dece8a8b41d9fb8729fbe3435278786931f1.configuration_bert.BertConfig'>. Fix one of those so they match!

Fix AutoModel not loading model correctly due to config_class inconsistency6617c7e3

josecar24

May 30

Same problem encountered. But It does not happen 1 month ago when I use it first time.

GCabas

Jun 12

Same problem for me

Charliep97

Jul 1

I'm having the same issue.

josecar24

Jul 3

I'm having the same issue.

This issue could be fixed following https://huggingface.co/zhihan1996/DNABERT-2-117M/commit/6617c7e3829423fddd80ba03c7c7dc4f8aab4d19

mmokoatle

Aug 29

having the same issue, what is the solution?

GCabas

Aug 29

@mmokoatle This worked for me:
tokenizer = AutoTokenizer.from_pretrained("zhihan1996/DNABERT-2-117M", trust_remote_code=True)
config = BertConfig.from_pretrained("zhihan1996/DNABERT-2-117M")
model = AutoModel.from_pretrained("zhihan1996/DNABERT-2-117M", trust_remote_code=True, config=config)

mmokoatle

Aug 29

@GCabas thanks, that worked...but now I have another error caused by :
hidden_states = model(inputs)[0]

error: "AssertionError: "

GCabas

Aug 29

@mmokoatle Maybe you should show more code :)

mmokoatle

Aug 29

@GCabas , apologies, see full code and error below

import torch
from transformers import AutoTokenizer, AutoModel, BertConfig

tokenizer = AutoTokenizer.from_pretrained("zhihan1996/DNABERT-2-117M", trust_remote_code=True)
config = BertConfig.from_pretrained("zhihan1996/DNABERT-2-117M")
model = AutoModel.from_pretrained("zhihan1996/DNABERT-2-117M", trust_remote_code=True, config=config)

dna = "ACGTAGCATCGGATCTATCTATCGACACTTGGTTATCGATCTACGAGCATCTCGTTAGC"
inputs = tokenizer(dna, return_tensors = 'pt')["input_ids"]
hidden_states = model(inputs)[0] # [1, sequence_length, 768] ###error from this line of code

error: "AssertionError: "

GCabas

Aug 30

@mmokoatle I did an example notebook using this model, maybe you could watch it and resolve your doubts :)
https://www.kaggle.com/code/gabrielcabas/dnabert-for-classification

mmokoatle

Aug 30

@GCabas thank you, i will go through your notebook. Which transformer version are you using?

GCabas

Aug 30

@mmokoatle transformers 4.42.3

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment