open_llama_3b_v2-ds
https://huggingface.co/openlm-research/open_llama_3b_v2
Inference
import deepsparse
model = deepsparse.TextGeneration(model="hf:mgoin/open_llama_3b_v2-ds")
model("Tell me a joke.")
Export
git clone https://huggingface.co/openlm-research/open_llama_3b_v2
sparseml.transformers.export_onnx --model_path ./open_llama_3b_v2 --task text-generation --sequence_length 512
- Downloads last month
- 10
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.