Sentence Similarity
English
txtai
davidmezzetti's picture
September 2024 data update
ffa284b
|
raw
history blame
643 Bytes
metadata
inference: false
language: en
license:
  - cc-by-sa-3.0
  - gfdl
library_name: txtai
tags:
  - sentence-similarity
datasets:
  - NeuML/wikipedia-20240901

Wikipedia txtai embeddings slim

This is a txtai embeddings index for the English edition of Wikipedia.

The slim version has the 100K most popular Wikipedia pages ranked by page views. This embeddings index also has graph indexing enabled, which enables using this as a source for GraphRAG.

See the txtai-wikipedia model page for additional information on this datasource.