Sentence Similarity
English
txtai
davidmezzetti's picture
September 2024 data update
ffa284b
|
raw
history blame
643 Bytes
---
inference: false
language: en
license:
- cc-by-sa-3.0
- gfdl
library_name: txtai
tags:
- sentence-similarity
datasets:
- NeuML/wikipedia-20240901
---
# Wikipedia txtai embeddings slim
This is a [txtai](https://github.com/neuml/txtai) embeddings index for the [English edition of Wikipedia](https://en.wikipedia.org/).
The slim version has the `100K most popular` Wikipedia pages ranked by page views. This embeddings index also has graph indexing enabled, which enables using this as a source for GraphRAG.
See the [txtai-wikipedia](https://hf.co/models/neuml/txtai-wikipedia) model page for additional information on this datasource.