Add `"add_prefix_space": true,`; this allows for much stronger token-level performance (e.g. NER, ColBERT) (#48) b7cc329 verified bclavie tomaarsen HF staff commited on 1 day ago
Purge duplicate "decoder.weight", rely on tied weights instead c0e4443 Tom Aarsen commited on 26 days ago
Update the arch: ModernBertModel to ModernBertForMaskedLM 290243f Tom Aarsen commited on Dec 11, 2024