Edit model card

Model Summary

This moded is an FP8 quantization of the microsoft/Phi-3-medium-128k-instruct model in safetensors format.

License

The model is licensed under the MIT license.

Downloads last month
32
Safetensors
Model size
14B params
Tensor type
BF16
·
F8_E4M3
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.