Purge duplicate "decoder.weight", rely on tied weights instead c0e4443 Tom Aarsen commited on 27 days ago
Update the arch: ModernBertModel to ModernBertForMaskedLM 290243f Tom Aarsen commited on Dec 11, 2024