Tom Aarsen

tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Recent Activity

liked a model about 2 hours ago
jinaai/jina-clip-v2
liked a dataset about 3 hours ago
Babelscape/cner
New activity about 3 hours ago
sentence-transformers/all-mpnet-base-v2

Articles

Organizations

tomaarsen's activity

upvoted an article about 7 hours ago
view article
Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

10
upvoted an article 1 day ago
upvoted an article 8 days ago
view article
Article

Releasing the largest multilingual open pretraining dataset

94
upvoted an article 17 days ago
view article
Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

17
upvoted an article 23 days ago
upvoted an article 25 days ago
view article
Article

Visually Multilingual: Introducing mcdse-2b

By marco
37
upvoted an article 30 days ago
view article
Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

41
upvoted an article about 1 month ago
view article
Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

63
upvoted an article about 1 month ago
upvoted 2 articles about 1 month ago
view article
Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By abhinand
30