YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
--max_seq_len 8192 --compress_pos_emb 4 --loader exllama_hf check monkeypatch in ooba ..
for 16384 compress_pos_emb 8
works on 2 a6000 just fine
- Downloads last month
- 12
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.