added arbritrary changes that might lead to be faster inference and hence will be used for all further changes changes are :: --remove_input_padding \ this is simple change to get contextual embeddings and patches them together or something these don't slow it might be better to increase speed --use_inflight_batching \ increases latency period