arxiv:2410.22587
Prof. Ivan Yamshchikov
ivan-the-bearable
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
19 days ago
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer
Training
upvoted
a
paper
23 days ago
Toxicity of the Commons: Curating Open-Source Pre-Training Data
upvoted
an
article
23 days ago
Detoxifying the Commons
Organizations
None yet