cortecs
/

Meta-Llama-3-70B-Instruct-GPTQ

@@ -1,10 +1,15 @@
 This is a quantized model of [Llama-3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) using GPTQ developed by [IST Austria](https://ist.ac.at/en/research/alistarh-group/)
  using the following configuration:
  - 4bit (8bit will follow)
 - Act order: True
  - Group size: 128
  - Seq. length: 4096
- - Dataset: [Wikitext2](https://huggingface.co/datasets/wikitext)
 ## Usage
 Install **vLLM** and
     run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):

+---
+datasets: wikitext
+license: apache-2.0
+license_link: https://llama.meta.com/llama3/license/
+---
 This is a quantized model of [Llama-3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) using GPTQ developed by [IST Austria](https://ist.ac.at/en/research/alistarh-group/)
  using the following configuration:
  - 4bit (8bit will follow)
 - Act order: True
  - Group size: 128
  - Seq. length: 4096
 ## Usage
 Install **vLLM** and
     run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):