Pipeline output skips spaces between words

by chancar - opened Aug 29, 2022

Aug 29, 2022

Hi there! I have been using this model and turns out that when I pipe it after fine-tuning, the utput ignores blank spaces and returns all words together, as in:

[{'entity_group': 'LABEL_0',
'score': 0.4824247,
'word': 'Thedogandthecatwenttothehouse',
'start': 0,
'end': 325}]

I have tried add_prefix_space=True in the tokenizer, but it does not seem to be working. Could someone give me a little push on this? Many thanks in advance.

chancar changed discussion status to closed Aug 30, 2022

chancar changed discussion status to open Aug 30, 2022

chancar changed discussion status to closed Sep 1, 2022

chancar changed discussion status to open Sep 1, 2022

chancar changed discussion status to closed Sep 3, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment