Update README.md
Browse files
README.md
CHANGED
@@ -39,7 +39,7 @@ Türkçe görsel dil modelini deneyimlemeye hazır mısınız? Hadi başlayalım
|
|
39 |
|
40 |
## English
|
41 |
This model is a multimodal large language model that combines [SigLIP](https://huggingface.co/docs/transformers/en/model_doc/siglip) as its vision encoder with [GPT2-large](https://huggingface.co/docs/transformers/en/model_doc/gpt2) as its language model. The vision projector connects the two modalities together.
|
42 |
-
Its architecture closely resembles [PaliGemma](https://
|
43 |
|
44 |
Here's the summary of the development process:
|
45 |
|
|
|
39 |
|
40 |
## English
|
41 |
This model is a multimodal large language model that combines [SigLIP](https://huggingface.co/docs/transformers/en/model_doc/siglip) as its vision encoder with [GPT2-large](https://huggingface.co/docs/transformers/en/model_doc/gpt2) as its language model. The vision projector connects the two modalities together.
|
42 |
+
Its architecture closely resembles [PaliGemma](https://huggingface.co/docs/transformers/v4.44.0/model_doc/paligemma), with some refined adjustments to the vision projector and the causal language modeling.
|
43 |
|
44 |
Here's the summary of the development process:
|
45 |
|