Update README.md
Browse files
README.md
CHANGED
@@ -12,15 +12,18 @@ pipeline_tag: image-feature-extraction
|
|
12 |
|
13 |
# Model Card for InternViT-6B-224px
|
14 |
|
15 |
-
|
16 |
|
17 |
-
\[[Paper](https://arxiv.org/abs/2312.14238)\] \[[GitHub](https://github.com/OpenGVLab/InternVL)\]
|
18 |
|
19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
|
21 |
-
It is _**the largest open-source vision/vision-language foundation model (14B)**_ to date, achieving _**32 state-of-the-art**_ performances on a wide range of tasks such as visual perception, cross-modal retrieval, multimodal dialogue, etc.
|
22 |
-
|
23 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/k5UATwX5W2b5KJBN5C58x.png)
|
24 |
|
25 |
## Model Details
|
26 |
- **Model Type:** vision foundation model, feature backbone
|
|
|
12 |
|
13 |
# Model Card for InternViT-6B-224px
|
14 |
|
15 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/dhm1rXFgszG05TBIgY6tS.webp" alt="Image Description" width="300" height="300">
|
16 |
|
17 |
+
\[[Paper](https://arxiv.org/abs/2312.14238)\] \[[GitHub](https://github.com/OpenGVLab/InternVL)\] \[[Chat Demo](https://internvl.opengvlab.com/)\] \[[中文解读](https://zhuanlan.zhihu.com/p/675877376)]
|
18 |
|
19 |
+
| Model | Date | Download | Note |
|
20 |
+
| ----------------------- | ---------- | ---------------------------------------------------------------------- | -------------------------------- |
|
21 |
+
| InternViT-6B-448px-V1.5 | 2024.04.20 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-5) | support dynamic resolution, super strong OCR (🔥new) |
|
22 |
+
| InternViT-6B-448px-V1.2 | 2024.02.11 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-2) | 448 resolution |
|
23 |
+
| InternViT-6B-448px-V1.0 | 2024.01.30 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-0) | 448 resolution |
|
24 |
+
| InternViT-6B-224px | 2023.12.22 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-224px) | vision foundation model |
|
25 |
+
| InternVL-14B-224px | 2023.12.22 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternVL-14B-224px) | vision-language foundation model |
|
26 |
|
|
|
|
|
|
|
27 |
|
28 |
## Model Details
|
29 |
- **Model Type:** vision foundation model, feature backbone
|