Occupying-Mars's picture
Upload folder using huggingface_hub
0ade4d4 verified
raw
history blame contribute delete
350 Bytes
added arbritrary changes that might lead to be faster inference and hence
will be used for all further changes
changes are ::
--remove_input_padding \
this is simple change to get contextual embeddings and patches them together
or something these don't slow it might be better to increase speed
--use_inflight_batching \
increases latency period