-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 122 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper β’ 1810.04805 β’ Published β’ 27 -
Universal Language Model Fine-tuning for Text Classification
Paper β’ 1801.06146 β’ Published β’ 8 -
Language Models are Few-Shot Learners
Paper β’ 2005.14165 β’ Published β’ 20
Effi PRO
itseffi
AI & ML interests
None yet
Recent Activity
liked a model about 4 hours ago
mlx-community/DeepSeek-V4-Flash-2bit-DQ liked a Space 5 days ago
smolagents/ml-intern updated a Space 5 days ago
itseffi/reachy-mini-chat