alonzogarbanzo
/

Bloom-1b7-creative-writing

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alonzogarbanzo commited on Feb 27, 2024

Commit

01995b2

·

verified ·

1 Parent(s): 7a88c71

Update README.md

Files changed (1) hide show

README.md +12 -6

README.md CHANGED Viewed

@@ -8,21 +8,17 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # Bloom-1b7-creative-writing
 This model is a fine-tuned version of [bigscience/bloom-1b7](https://huggingface.co/bigscience/bloom-1b7) on the [adambjorn/UnrelatedForgettingOverhead](https://huggingface.co/datasets/adambjorn/UnrelatedForgettingOverhead) creative writing dataset.
 ## Model description
 More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
@@ -30,6 +26,8 @@ More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -45,7 +43,15 @@ The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

   results: []
 ---
 # Bloom-1b7-creative-writing
 This model is a fine-tuned version of [bigscience/bloom-1b7](https://huggingface.co/bigscience/bloom-1b7) on the [adambjorn/UnrelatedForgettingOverhead](https://huggingface.co/datasets/adambjorn/UnrelatedForgettingOverhead) creative writing dataset.
 ## Model description
 More information needed
 ## Intended uses & limitations
+Intended for use on a student group project for Portland State University's Winter 2024 LLMs Course.
 ## Training and evaluation data
 ## Training procedure
+Trained on a single RTX 3090 card.
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
+After final epoch:
+{'loss': 0.0472, 'learning_rate': 1.4893617021276598e-06, 'epoch': 4.95}
+After full completion
+{'train_runtime': 563.2707,
+'train_samples_per_second': 1.687,
+'train_steps_per_second': 0.417,
+'train_loss': 0.8475136074614018,
+'epoch': 4.95}
 ### Framework versions