Richard Diehl Martinez

rdiehlmartinez

AI & ML interests

NLP, Multilingual Language Modeling, Efficient Pre-Training

Recent Activity

updated a dataset about 6 hours ago
pico-lm/pretokenized-dolma
updated a Space about 6 hours ago
pico-lm/README
updated a dataset about 7 hours ago
pico-lm/pretokenized-dolma-tinsy

Organizations

rdiehlmartinez's activity

New activity in allenai/dolma about 1 month ago

Gzip size of v1.7 is 4.1TB?

2
#47 opened about 1 month ago by rdiehlmartinez

Gzip size of v1.7 is 4.1TB?

2
#47 opened about 1 month ago by rdiehlmartinez