MoMonir/gte-Qwen1.5-7B-instruct-GGUF

This model was converted to GGUF format from Alibaba-NLP/gte-Qwen1.5-7B-instruct using llama.cpp
Refer to the original model card for more details on the model.

Note: This is an Embedding Model

For more information about Embedding check OpenAI Embedding Document

Downloads last month: 13

GGUF

Model size

7.72B params

Architecture

qwen2

4-bit

5-bit

6-bit

Inference Examples

Sentence Similarity

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

MoMonir
/

gte-Qwen1.5-7B-instruct-GGUF

MoMonir/gte-Qwen1.5-7B-instruct-GGUF

Note: This is an Embedding Model

Spaces using MoMonir/gte-Qwen1.5-7B-instruct-GGUF 2