uf-aice-lab
/

Llama-2-QLoRA

Question Answering

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

uf-aice-lab commited on Sep 14, 2023

Commit

1ea118a

•

1 Parent(s): 1fe67bf

Rename README (1).md to README.md

Files changed (1) hide show

README (1).md → README.md +4 -4

README (1).md → README.md RENAMED Viewed

@@ -4,19 +4,19 @@ language:
 - en
 pipeline_tag: question-answering
 ---
-# Llama-mt-lora
 <!-- Provide a quick summary of what the model is/does. -->
-This model is fine-tuned with LLaMA with 8 Nvidia A100-80G GPUs using 3,000,000 groups of conversations in the context of mathematics by students and facilitators on Algebra Nation (https://www.mathnation.com/). Llama-mt-lora consists of 32 layers and over 7 billion parameters, consuming up to 13.5 gigabytes of disk space. Researchers can experiment with and finetune the model to help construct math conversational AI that can effectively respond generation in a mathematical context.
 ### Here is how to use it with texts in HuggingFace
 ```python
 import torch
 import transformers
 from transformers import LlamaTokenizer, LlamaForCausalLM
-tokenizer = LlamaTokenizer.from_pretrained("Fan21/Llama-mt-lora")
 mdoel = LlamaForCausalLM.from_pretrained(
-        "Fan21/Llama-mt-lora",
         load_in_8bit=False,
         torch_dtype=torch.float16,
         device_map="auto",

 - en
 pipeline_tag: question-answering
 ---
+# Llama-2-Qlora
 <!-- Provide a quick summary of what the model is/does. -->
+This model is fine-tuned with LLaMA-2 with 8 Nvidia A100-80G GPUs using 3,000,000 groups of conversations in the context of mathematics by students and facilitators on Algebra Nation (https://www.mathnation.com/). Llama-mt-lora consists of 32 layers and over 7 billion parameters, consuming up to 13.5 gigabytes of disk space. Researchers can experiment with and finetune the model to help construct math conversational AI that can effectively respond generation in a mathematical context.
 ### Here is how to use it with texts in HuggingFace
 ```python
 import torch
 import transformers
 from transformers import LlamaTokenizer, LlamaForCausalLM
+tokenizer = LlamaTokenizer.from_pretrained("uf-aice-lab/Llama-2-QLoRA")
 mdoel = LlamaForCausalLM.from_pretrained(
+        "uf-aice-lab/Llama-2-QLoRA",
         load_in_8bit=False,
         torch_dtype=torch.float16,
         device_map="auto",