CarperAI
/

stable-vicuna-13b-delta

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jon-tow commited on Apr 28, 2023

Commit

d370485

•

1 Parent(s): fa68c81

cleanup: final edits

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -60,8 +60,8 @@ print(tokenizer.decode(tokens[0], skip_special_tokens=True))
 * **Language(s)**: English
 * **Library**: [trlX](https://github.com/CarperAI/trlx)
 * **License for delta weights**: [CC-BY-NC-SA-4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/)
-* *Note*: License for the base LLaMA model's weights is Meta's [non-commercial bespoke license](https://github.com/facebookresearch/llama/blob/main/MODEL_CARD.md).
-* **Contact**: For questions and comments about the model, visit the [StableFoundation](https://discord.gg/stablediffusion) and [CarperAI](https://discord.com/invite/KgfkCVYHdu) Discord servers.
 | Hyperparameter            | Value |
 |---------------------------|-------|
@@ -81,7 +81,7 @@ The reward model used during RLHF was also trained on [OpenAssistant Conversatio
 ### Training Procedure
-`CarperAI/sstable-vicuna-13b-delta` was trained using PPO as implemented in [`trlX`](https://github.com/CarperAI/trlx/blob/main/trlx/trainer/accelerate_ppo_trainer.py) with the following configuration:
 |  Hyperparameter   |  Value  |
 |-------------------|---------|

 * **Language(s)**: English
 * **Library**: [trlX](https://github.com/CarperAI/trlx)
 * **License for delta weights**: [CC-BY-NC-SA-4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/)
+  * *Note*: License for the base LLaMA model's weights is Meta's [non-commercial bespoke license](https://github.com/facebookresearch/llama/blob/main/MODEL_CARD.md).
+* **Contact**: For questions and comments about the model, visit the [CarperAI](https://discord.com/invite/KgfkCVYHdu) and [StableFoundation](https://discord.gg/stablediffusion) Discord servers.
 | Hyperparameter            | Value |
 |---------------------------|-------|
 ### Training Procedure
+`CarperAI/stable-vicuna-13b-delta` was trained using PPO as implemented in [`trlX`](https://github.com/CarperAI/trlx/blob/main/trlx/trainer/accelerate_ppo_trainer.py) with the following configuration:
 |  Hyperparameter   |  Value  |
 |-------------------|---------|