Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Pramodith
/
bert-sparse-sliding-window-attention
like
0
Fill-Mask
Transformers
Safetensors
bert
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
2340dc7
bert-sparse-sliding-window-attention
1 contributor
History:
3 commits
Pramodith
Training in progress, step 200
2340dc7
12 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
config.json
670 Bytes
Training in progress, step 10
12 months ago
model.safetensors
440 MB
LFS
Training in progress, step 200
12 months ago
special_tokens_map.json
125 Bytes
Training in progress, step 10
12 months ago
tokenizer.json
711 kB
Training in progress, step 10
12 months ago
tokenizer_config.json
1.19 kB
Training in progress, step 10
12 months ago
training_args.bin
pickle
Detected Pickle imports (8)
"accelerate.utils.dataclasses.DistributedType"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.training_args.OptimizerNames"
,
"torch.device"
,
"transformers.training_args.TrainingArguments"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.HubStrategy"
How to fix it?
4.54 kB
LFS
Training in progress, step 200
12 months ago
vocab.txt
232 kB
Training in progress, step 10
12 months ago