vocab-transformers
/

msmarco-distilbert-word2vec256k-MLM_445k_emb_updated

Fill-Mask

Transformers

PyTorch

distilbert

Inference Endpoints

Model card Files Files and versions Community

msmarco-distilbert-word2vec256k-MLM_445k_emb_updated / README.md

nreimers

Update README.md

7b8f2c0 almost 3 years ago

preview code

raw

history blame contribute delete

433 Bytes

Model

This model is based on nicoladecao/msmarco-word2vec256000-distilbert-base-uncased with a 256k sized vocabulary initialized with word2vec.

This model has been trained with MLM on the MS MARCO corpus collection for 445k steps. See train_mlm.py for the train script. It was run on 2x V100 GPUs.

Note: Token embeddings where updated!