Update README.md

88bd34b verified 5 months ago

4 kB

	---
	license: mit
	language:
	- en
	library_name: transformers
	tags:
	- feature-extraction
	- marqo
	- retrieval
	---

	<h1 align="center">Marqo's Chimera arctic-bge-m</h1>
	<h4 align="center">
	<p>
	<a href=#this-model>This Model</a> \|
	<a href=#usage>Usage</a> \|
	<a href="#faq">FAQ</a> \|
	<a href="#about-marqo">About Marqo</a> \|
	<a href="#acknowledgement">Acknowledgement</a>
	<p>
	</h4>

	## This Model

	This is a chimera model which concatenates embeddings from [Snowflake/snowflake-arctic-embed-m](https://huggingface.co/Snowflake/snowflake-arctic-embed-m) and [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5). This model produces an embedding with 1536 dimensions (768+768) and has a total of 218M parameters (109+109). Embeddings from each model are unit normalized prior to concatenation.

	## Usage

	```python
	import torch
	from torch.nn.functional import normalize
	from transformers import AutoModel, AutoTokenizer


	# Load the model and tokenizer.
	tokenizer = AutoTokenizer.from_pretrained("Marqo/marqo-chimera-arctic-bge-m")
	model = AutoModel.from_pretrained("Marqo/marqo-chimera-arctic-bge-m", trust_remote_code=True)
	model.eval()

	# Model constants.
	query_prefix = 'Represent this sentence for searching relevant passages: '

	# Your queries and docs.
	queries = [
	"What is vector search?",
	"Where can I get the best pizza?"
	]

	documents = [
	"Marqo is an end-to-end platform for embedding training and retrieval.",
	"Definitely Naples! The birthplace of pizza, and it’s as authentic as it gets."
	]


	# Add query prefix and tokenize queries and docs.
	queries_with_prefix = [f"{query_prefix}{q}" for q in queries]
	query_tokens = tokenizer(queries_with_prefix, padding=True, truncation=True, return_tensors='pt', max_length=512)
	document_tokens = tokenizer(documents, padding=True, truncation=True, return_tensors='pt', max_length=512)

	# Use the model to generate text embeddings.
	with torch.inference_mode():
	query_embeddings = model(**query_tokens)
	document_embeddings = model(**document_tokens)

	# Remember to normalize embeddings.
	query_embeddings = normalize(query_embeddings)
	document_embeddings = normalize(document_embeddings)

	# Scores via dotproduct.
	scores = query_embeddings @ document_embeddings.T

	# Pretty-print the results.
	for query, query_scores in zip(queries, scores):
	doc_score_pairs = list(zip(documents, query_scores))
	doc_score_pairs = sorted(doc_score_pairs, key=lambda x: x[1], reverse=True)
	print(f'Query: "{query}"')
	for document, score in doc_score_pairs:
	print(f'Score: {score:.4f} \| Document: "{document}"')
	print()


	# Query: "What is vector search?"
	# Score: 0.4997 \| Document: "Marqo is an end-to-end platform for embedding training and retrieval."
	# Score: 0.2509 \| Document: "Definitely Naples! The birthplace of pizza, and it’s as authentic as it gets."

	# Query: "Where can I get the best pizza?"
	# Score: 0.7444 \| Document: "Definitely Naples! The birthplace of pizza, and it’s as authentic as it gets."
	# Score: 0.3303 \| Document: "Marqo is an end-to-end platform for embedding training and retrieval."
	```

	## FAQ

	__Q: Do I need to prefix queries?__

	__A:__ Yes, this model has the same rules for prefixing as its constituent models. Queries in asymmetric retrieval should be prefixed with `"Represent this sentence for searching relevant passages: "`.

	## About Marqo

	[Marqo](https://www.marqo.ai/) is an end-to-end platform for training embeddings models and building vector search. Marqo is available as an open-source offering on our [GitHub](https://github.com/marqo-ai/marqo) or as a managed cloud service on [Marqo Cloud](https://cloud.marqo.ai).

	## Acknowledgement

	We want to acknowledge the original creators of the [Snowflake/snowflake-arctic-embed-m](https://huggingface.co/Snowflake/snowflake-arctic-embed-m) and [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) models which are used to create this model.