SliME Model Card

image/jpeg

Model details

Model type:

SliME is an open-source chatbot trained by fine-tuning LLM on multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture. Base LLM: meta-llama/Meta-Llama-3-8B-Instruct

image/png

Paper or resources for more information: https://github.com/yfzhang114/SliME

License

Llama 3 is licensed under the LLAMA 3 Community License, Copyright (c) Meta Platforms, Inc. All Rights Reserved.

Where to send questions or comments about the model: https://github.com/yfzhang114/SliME/issues

Intended use

Primary intended uses: The primary use of SliME is research on large multimodal models and chatbots.

Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.

Training dataset

  • SharedGPT4v sft data
  • SMR data

Evaluation dataset

A collection of 15 benchmarks, including 5 academic VQA benchmarks and 10 recent benchmarks specifically proposed for instruction-following LMMs.

image/png

Downloads last month
12
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Dataset used to train yifanzhang114/SliME-Llama3-8B-lora

Collection including yifanzhang114/SliME-Llama3-8B-lora