Transformers
PyTorch
Safetensors
conversational
Inference Endpoints

load model error

#1
by wdkwdkwdk - opened

RuntimeError: Error(s) in loading state_dict for OpenAIGPTLMHeadModel:
size mismatch for transformer.h.0.attn.bias: copying a param with shape torch.Size([1, 1, 512, 512]) from checkpoint, the shape in current model is torch.Size([1, 1, 513, 513]).
size mismatch for transformer.h.1.attn.bias: copying a param with shape torch.Size([1, 1, 512, 512]) from checkpoint, the shape in current model is torch.Size([1, 1, 513, 513]).

Sign up or log in to comment