BytedanceDouyinContent
/

SAIL-VL-2B

Model card Files Files and versions Community

zijian.kang commited on Dec 20, 2024

Commit

5b88afc

·

1 Parent(s): b59cd22

slight adjust readme

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -20,13 +20,13 @@ In a word, SAIL-VL is a foundational VLM for vision-language applications. Welco
 ## Model Card
-Model Architecture:
 | Architecture | ViT | LLM | Adapter | Token Merge | Resolution |
 | --- | --- | --- | --- | --- | --- |
 | SAIL-VL-2B | [🤗InternViT-300M](https://huggingface.co/OpenGVLab/InternViT-300M-448px) | [🤗Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) | 2-layer MLP | 2x2 | 448x448xN |
-Training recipes Overview:
 Sail-VL benefits from high-quality data and carefully curated training recipes. We find the data quality, quantity and the design of curriculum training pipeline is crucial for model performance. With the proper design and data, the model's capacity scales effectively with data expansion at all stages, leading to enhanced performance. More details will be released soon.
@@ -89,14 +89,14 @@ We visualize some of examples from LLaVA-Bench to show the capabilities of our m
 ## How to Use
-The basic usage and dynamic crop strategy of SAIL-VL follows InternVL2, you can easily switch Intern-VL series models to our model. Here is a simple example of using our model:
-Requirements:
 ```
 pip3 install einops transformers timm
 ```
-Code:
 ```Python
 import numpy as np

 ## Model Card
+### Model Architecture:
 | Architecture | ViT | LLM | Adapter | Token Merge | Resolution |
 | --- | --- | --- | --- | --- | --- |
 | SAIL-VL-2B | [🤗InternViT-300M](https://huggingface.co/OpenGVLab/InternViT-300M-448px) | [🤗Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) | 2-layer MLP | 2x2 | 448x448xN |
+### Training Recipes Overview:
 Sail-VL benefits from high-quality data and carefully curated training recipes. We find the data quality, quantity and the design of curriculum training pipeline is crucial for model performance. With the proper design and data, the model's capacity scales effectively with data expansion at all stages, leading to enhanced performance. More details will be released soon.
 ## How to Use
+The basic usage and dynamic crop strategy of SAIL-VL follows InternVL2, you can easily switch Intern-VL series of models to our model. Here is a simple example of using our model:
+### Requirements:
 ```
 pip3 install einops transformers timm
 ```
+### Code:
 ```Python
 import numpy as np