Building a Custom Arabic Semantic Search Model with Arabic Matryoshka Embeddings for RAG Using Sentence Transformers Sep 25, 2024 • 4
Arabic ModernBERT Collection This collection highlights efforts to enhance Arabic NLP tasks using the latest ModernBERT models. NAMAA-Space/AraModernBert-Topic-Classifier Text Classification • Updated 18 days ago • 131 • 4
Huggingface FineWeb2 Arabic Dataset Portions Collection of a comprehensive dataset of Arabic text sourced from the FineWeb2 project, representing diverse content across Arabic MSA and Dialect. HuggingFaceFW/fineweb-2 Viewer • Updated 21 days ago • 12.5B • 71.4k • 398 Omartificial-Intelligence-Space/FineWeb2-MSA Viewer • Updated Dec 15, 2024 • 907M • 3.15k • 1 Omartificial-Intelligence-Space/FineWeb2-Egyptian-Arabic Viewer • Updated Dec 12, 2024 • 23.9M • 109 • 1 Omartificial-Intelligence-Space/FineWeb2-Moroccan-Arabic Viewer • Updated Dec 12, 2024 • 69.6M • 102 • 1
Omartificial-Intelligence-Space/FineWeb2-Egyptian-Arabic Viewer • Updated Dec 12, 2024 • 23.9M • 109 • 1
Omartificial-Intelligence-Space/FineWeb2-Moroccan-Arabic Viewer • Updated Dec 12, 2024 • 69.6M • 102 • 1
Omartificial-Intelligence-Space/Arabic-Triplet-Matryoshka-V2 Sentence Similarity • Updated 6 days ago • 2.01k • 10
Omartificial-Intelligence-Space/Arabic-mpnet-base-all-nli-triplet Sentence Similarity • Updated 6 days ago • 1.18k • 10
Omartificial-Intelligence-Space/Arabic-all-nli-triplet-Matryoshka Sentence Similarity • Updated 6 days ago • 545 • 2
Omartificial-Intelligence-Space/Arabert-all-nli-triplet-Matryoshka Sentence Similarity • Updated 6 days ago • 1.89k • 10
Omartificial-Intelligence-Space/Marbert-all-nli-triplet-Matryoshka Sentence Similarity • Updated 19 days ago • 492 • 1
Omartificial-Intelligence-Space/Arabic-MiniLM-L12-v2-all-nli-triplet Sentence Similarity • Updated 19 days ago • 519 • 4
Omartificial-Intelligence-Space/Arabic-labse-Matryoshka Sentence Similarity • Updated 19 days ago • 508 • 2
Omartificial-Intelligence-Space/E5-all-nli-triplet-Matryoshka Sentence Similarity • Updated Dec 28, 2024 • 9 • 1
Omartificial-Intelligence-Space/FineWeb2-Najdi-Arabic Viewer • Updated Dec 12, 2024 • 48.4M • 125 • 1
Omartificial-Intelligence-Space/FineWeb2-North-Levantine-Arabic Viewer • Updated Dec 12, 2024 • 223k • 87 • 1
Omartificial-Intelligence-Space/FineWeb2-Moroccan-Arabic Viewer • Updated Dec 12, 2024 • 69.6M • 102 • 1
Omartificial-Intelligence-Space/FineWeb2-Egyptian-Arabic Viewer • Updated Dec 12, 2024 • 23.9M • 109 • 1
Omartificial-Intelligence-Space/ILMAAM-Arabic-Culturally-Aligned-MMLU Viewer • Updated Dec 11, 2024 • 12.5k • 63 • 1
Omartificial-Intelligence-Space/Arabic_Reasoning_Dataset Viewer • Updated Dec 1, 2024 • 9.21k • 70 • 2
Omartificial-Intelligence-Space/Arabic-finanical-rag-embedding-dataset Viewer • Updated Oct 9, 2024 • 7k • 103 • 6