Spaces:
Running
Running
title: README | |
emoji: π | |
colorFrom: purple | |
colorTo: green | |
sdk: static | |
pinned: false | |
<div align="center"> | |
<img src="https://github.com/yixuantt/picx-images-hosting/raw/master/bar.231u8j8ajg.webp" alt="Logo" width="100%" /> | |
</div> | |
## FinMTEB: Finance Massive Text Embedding Benchmark | |
Finance Massive Text Embedding Benchmark (FinMTEB), an embedding benchmark consists of **64 financial domain-specific text datasets**, across **English and Chinese**, spanning **seven different tasks**. All datasets in FinMTEB are finance-domain specific, either previously used in financial NLP research or newly developed by the authors. | |
--- | |
* Paper: [Do We Need Domain-Specific Embedding Models? An Empirical Investigation](https://arxiv.org/pdf/2409.18511v1) | |
* GitHub: [FinMTEB](https://github.com/yixuantt/FinMTEB/blob/main/README.md) |