view article Article There is no such thing as a tokenizer-free lunch By catherinearnett • 30 days ago • 83
Detecting Pretraining Data from Large Language Models Paper • 2310.16789 • Published Oct 25, 2023 • 11