mistral-community
/

pixtral-12b

Image-Text-to-Text

Inference Endpoints

Model card Files Files and versions Community

Rocketknight1 HF staff commited on Sep 24, 2024

Commit

460ee72

·

verified ·

1 Parent(s): 7f5b217

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -53,9 +53,10 @@ Each image captures a different scene, from a close-up of a dog to expansive nat
 """
 ```
-You can also use a chat template to format your chat history for Pixtral. Here's an example - note how you can interleave text and multiple images in the same message!
-Make sure that the `images` argument to the `processor` contains the images in the order that they appear in the chat, so that the model understands where
-each image is supposed to go.
 ```python
 from PIL import Image
@@ -105,6 +106,6 @@ If you're asking whether the dog can "live here," referring to the snowy landsca
 Would you like more information on any specific aspect?
 ```
-Note that while it may appear that spacing in the input is disrupted, this is caused by us skipping special tokens for display, and actually "Can this animal" and "live here" are
 correctly separated by image tokens. Try decoding with special tokens included to see exactly what the model sees!

 """
 ```
+You can also use a chat template to format your chat history for Pixtral. Make sure that the `images` argument to the `processor` contains the images in the order
+that they appear in the chat, so that the model understands where each image is supposed to go.
+Here's an example with text and multiple images interleaved in the same message:
 ```python
 from PIL import Image
 Would you like more information on any specific aspect?
 ```
+While it may appear that spacing in the input is disrupted, this is caused by us skipping special tokens for display, and actually "Can this animal" and "live here" are
 correctly separated by image tokens. Try decoding with special tokens included to see exactly what the model sees!