garage-bAInd
/

Platypus-30B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ariel Lee commited on Jun 27, 2023

Commit

8cb3d80

•

1 Parent(s): feb7b4c

Update README.md

Files changed (1) hide show

README.md +15 -2

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ metrics:
 # 🥳 Platypus-30B has arrived!
-Platypus-30B is an instruction fine-tuned model based on the LLaMA-30B transformer architecture and takes advantage of [LoRA]([LoRA](https://arxiv.org/pdf/2106.09685.pdf).
 | Metric                | Value |
 |-----------------------|-------|
@@ -47,7 +47,7 @@ Dataset of highly filtered and curated question and answer pairs. Release TBD.
 ## Limitations and bias
-The base LLaMA model is trained on various data, some of which may contain offensive, harmful, and biased content that can lead to toxic behavior. See Section 5.1 of the LLaMA [paper](https://arxiv.org/abs/2302.13971). We have not performed any studies to determine how fine-tuning on the aforementioned datasets affect the model's behavior and toxicity. Do not treat chat responses from this model as a substitute for human judgment or as a source of truth. Please use responsibly.
 ## Citations
@@ -58,4 +58,17 @@ The base LLaMA model is trained on various data, some of which may contain offen
   journal={arXiv preprint arXiv:2302.13971},
   year={2023}
 }
 ```

 # 🥳 Platypus-30B has arrived!
+Platypus-30B is an instruction fine-tuned model based on the LLaMA-30B transformer architecture and takes advantage of LoRA.
 | Metric                | Value |
 |-----------------------|-------|
 ## Limitations and bias
+The base LLaMA model is trained on various data, some of which may contain offensive, harmful, and biased content that can lead to toxic behavior. See Section 5.1 of the LLaMA paper. We have not performed any studies to determine how fine-tuning on the aforementioned datasets affect the model's behavior and toxicity. Do not treat chat responses from this model as a substitute for human judgment or as a source of truth. Please use responsibly.
 ## Citations
   journal={arXiv preprint arXiv:2302.13971},
   year={2023}
 }
+@article{DBLP:journals/corr/abs-2106-09685,
+  author       = {Edward J. Hu and
+                  Yelong Shen and
+                  Phillip Wallis and
+                  Zeyuan Allen{-}Zhu and
+                  Yuanzhi Li and
+                  Shean Wang and
+                  Weizhu Chen},
+  title        = {LoRA: Low-Rank Adaptation of Large Language Models},
+  journal      = {CoRR},
+  year         = {2021},
+  url          = {https://arxiv.org/abs/2106.09685},
+}
 ```