torch numpy dataclasses tiktoken