DeBERTaV3-small-ST-AdaptiveLayer-3L-ep2 / sentence_bert_config.json
bobox's picture
all layer trained for every step, 2 epoch, 50% warmup
6302c55 verified
raw
history blame
53 Bytes
{
"max_seq_length": 512,
"do_lower_case": false
}