lightonai
/

MonoQwen2-VL-v0.1

Model card Files Files and versions Community

NohTow commited on 15 days ago

Commit

6d48017

•

1 Parent(s): 78b0fe8

Update readme

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -8,9 +8,11 @@ tags:
 # MonoQwen2-VL-2B-LoRA-Reranker
 ## Model Overview
-The **MonoQwen2-VL-2B-LoRA-Reranker** is a LoRA fine-tuned version of the Qwen2-VL-2B model, optimized for reranking image-query relevance.
-It was train using [ColPali train set](https://huggingface.co/datasets/vidore/colpali_train_set)
 ## How to Use the Model
 Below is a quick example to rerank a single image against a user query using this model:

 # MonoQwen2-VL-2B-LoRA-Reranker
 ## Model Overview
+The **MonoQwen2-VL-v0.1** is a LoRA of the Qwen2-VL-2B model, optimized for reranking (i.e, asserting pointwise image-query relevance) using the [MonoT5](https://arxiv.org/pdf/2101.05667) objective.
+That is, given a couple of image and query fed into the prompt of the VLM, the model is tasked to generate "True" if the image is relevant to the query and "False" otherwise.
+During inference, a relevancy score can then be obtained by comparing the logits of the two tokens and this score can effectively be used to rerank the candidates generated by a first-stage retriever (such as DSE or ColPali) or filter them using a threshold.
+The [ColPali train set](https://huggingface.co/datasets/vidore/colpali_train_set) was used to train this model with negatives mined using DSE.
 ## How to Use the Model
 Below is a quick example to rerank a single image against a user query using this model: