Spaces:

binqiangliu
/

Zephyr7BAlpha

Runtime error

binqiangliu commited on Oct 23, 2023

Commit

dbb6c49

•

1 Parent(s): ab440a4

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -35,7 +35,7 @@ def load_quantized_model(model_name: str):
         #bnb_4bit_use_double_quant=True,
         bnb_4bit_use_double_quant=False,
         bnb_4bit_quant_type="nf4",
-        bnb_4bit_compute_dtype=torch.bfloat16
     )
     model = AutoModelForCausalLM.from_pretrained(

         #bnb_4bit_use_double_quant=True,
         bnb_4bit_use_double_quant=False,
         bnb_4bit_quant_type="nf4",
+        #bnb_4bit_compute_dtype=torch.bfloat16
     )
     model = AutoModelForCausalLM.from_pretrained(