davanstrien
/

Molmo-7B-D-0924

Image-Text-to-Text

text-generation

Inference Endpoints

Model card Files Files and versions Community

davanstrien HF staff commited on Oct 4

Commit

3af71f9

•

1 Parent(s): 0aabc4c

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -22,6 +22,14 @@ This is a copy of the original [Molmo 7B-D model card](https://huggingface.co/al
 **Note: The following implementation is a community-contributed endpoint handler and is not an official implementation. For the official model and its usage, please refer to the [official Molmo 7B-D model page](https://huggingface.co/allenai/Molmo-7B-D-0924).**
 If you've deployed the model using Hugging Face's Inference Endpoints with a community-contributed handler, you can use it with the following code:
 ```python

 **Note: The following implementation is a community-contributed endpoint handler and is not an official implementation. For the official model and its usage, please refer to the [official Molmo 7B-D model page](https://huggingface.co/allenai/Molmo-7B-D-0924).**
+You should see a `Deploy` via Inference Endpoints option at the top of this model card.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/60107b385ac3e86b3ea4fc34/kHR0wO_GchczmsmHtjJ1u.png)
+Currently, this handler uses `bloat16` for inference. The original authors found some differences in results vs using `float32` weights.
+I didn't find results that degraded much in my initial experiments, but I may change this implementation in the future.
 If you've deployed the model using Hugging Face's Inference Endpoints with a community-contributed handler, you can use it with the following code:
 ```python