部署好的vl1.5多模态大模型，目前只接受图片或文本输入吗？是否接受PDF/ppt/excel等文件输入？这种怎么实现

#26

by wqw0806 - opened 14 days ago

14 days ago

部署好的vl1.5多模态大模型，目前只接受图片或文本输入吗？是否接受PDF/ppt/excel等文件输入？这种怎么实现 messages = [
{"role": "system", "content": sys_prompt},
{
'role': 'user',
'content': [
{'type': 'text', 'text': prompt},
{'type': 'image_url', 'image_url': {'url': f"data:image/png;base64,{image}"}}
],
}
]

czczup

OpenGVLab org 12 days ago

可以用一些开源的python包把别的类型的文件转换成图文再输入模型

czczup changed discussion status to closed 6 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment