openbmb
/

MiniCPM-Llama3-V-2_5-int4

Visual Question Answering

feature-extraction

4-bit precision

Model card Files Files and versions Community

finalf0 commited on May 20

Commit

d6b3c68

•

1 Parent(s): 205b025

Update README.md

Files changed (1) hide show

README.md +40 -1

README.md CHANGED Viewed

@@ -1,3 +1,42 @@
 ## MiniCPM-Llama3-V 2.5
-See [here](https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5) for more detail.

+---
+pipeline_tag: visual-question-answering
+---
 ## MiniCPM-Llama3-V 2.5
+More detail about [MiniCPM-Llama3-V 2.5](https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5).
+## Usage
+Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.10：
+```
+Pillow==10.1.0
+torch==2.1.2
+torchvision==0.16.2
+transformers==4.40.0
+sentencepiece==0.1.99
+accelerate==0.30.1
+bitsandbytes==0.43.1
+```
+```python
+# test.py
+import torch
+from PIL import Image
+from transformers import AutoModel, AutoTokenizer
+model = AutoModel.from_pretrained('openbmb/MiniCPM-Llama3-V-2_5-int4', trust_remote_code=True)
+tokenizer = AutoTokenizer.from_pretrained('openbmb/MiniCPM-Llama3-V-2_5-int4', trust_remote_code=True)
+model.eval()
+image = Image.open('xx.jpg').convert('RGB')
+question = 'What is in the image?'
+msgs = [{'role': 'user', 'content': question}]
+res = model.chat(
+    image=image,
+    msgs=msgs,
+    tokenizer=tokenizer,
+    sampling=True,
+    temperature=0.7
+)
+print(res)
+```