Sentence Similarity
English
txtai
File size: 643 Bytes
30c7935
 
 
 
 
 
 
 
 
 
ffa284b
30c7935
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
inference: false
language: en
license:
- cc-by-sa-3.0
- gfdl
library_name: txtai
tags:
- sentence-similarity
datasets:
- NeuML/wikipedia-20240901
---

# Wikipedia txtai embeddings slim

This is a [txtai](https://github.com/neuml/txtai) embeddings index for the [English edition of Wikipedia](https://en.wikipedia.org/).

The slim version has the `100K most popular` Wikipedia pages ranked by page views. This embeddings index also has graph indexing enabled, which enables using this as a source for GraphRAG.

See the [txtai-wikipedia](https://hf.co/models/neuml/txtai-wikipedia) model page for additional information on this datasource.