Instructions to use Lin-Chen/ShareGPT4V-7B_Pretrained_vit-large336-l12 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Lin-Chen/ShareGPT4V-7B_Pretrained_vit-large336-l12 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-feature-extraction", model="Lin-Chen/ShareGPT4V-7B_Pretrained_vit-large336-l12")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Lin-Chen/ShareGPT4V-7B_Pretrained_vit-large336-l12", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Configuration Parsing Warning:Config file config.json cannot be fetched (too big)
ShareGPT4V Model Card
Model details
Model type: This is the vision tower of ShareGPT4V-7B fine-tuned with our ShareGPT4V dataset.
Model date: This vision tower was trained in Nov 2023.
Paper or resources for more information: [Project] [Paper] [Code]
License
Llama 2 is licensed under the LLAMA 2 Community License, Copyright (c) Meta Platforms, Inc. All Rights Reserved.
Intended use
Primary intended uses: The primary use of this vision tower is research on large multimodal models and chatbots.
Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
Training dataset
- 1.2M high-quality image-text pairs
- Downloads last month
- 19,198