Model Description

This is a fine-tune of mirth/chonky_distilbert_base_uncased_1 with the goal being to see if it can be improved further by training it on more data.

Comparison with chonky_distilbert_base_uncased_1 and chonky_modernbert_base_1

The following diffs were produced by comparing the chunks produced by the two models to be compared using an example text. Each chunk is delimited by the string -------------------- on a separate line.

Training Data, Code and Hardware

The model was fine-tuned for one epoch on mamei16/wikipedia_paragraphs. The training code can found here. Fine-tuning was run on an RTX 5090 for about 3 hours and 45 minutes.

Downloads last month: 628

Safetensors

Model size

66.4M params

Tensor type

F32

Model tree for mamei16/chonky_distilbert_base_uncased_1.1

Base model

distilbert/distilbert-base-uncased

Finetuned

mirth/chonky_distilbert_base_uncased_1

Finetuned

(1)

this model

mamei16
/

chonky_distilbert_base_uncased_1.1

Model Description

Comparison with chonky_distilbert_base_uncased_1 and chonky_modernbert_base_1

Training Data, Code and Hardware

Model tree for mamei16/chonky_distilbert_base_uncased_1.1

Datasets used to train mamei16/chonky_distilbert_base_uncased_1.1

Space using mamei16/chonky_distilbert_base_uncased_1.1 1