Collection of Arabic Tokenizers with different sizes based on SentencePiece & PBE Encodings suitable for training LLMs