alonzogarbanzo
/

Bloom-1b7-creative-writing-IT-baseline

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alonzogarbanzo commited on Feb 27, 2024

Commit

f149d38

·

verified ·

1 Parent(s): 94fdf3e

Update README.md

Files changed (1) hide show

README.md +22 -4

README.md CHANGED Viewed

@@ -8,12 +8,12 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # Bloom-1b7-creative-writing-IT
-This model is a fine-tuned version of [bigscience/bloom-1b7](https://huggingface.co/bigscience/bloom-1b7) on an unknown dataset.
 ## Model description
@@ -25,10 +25,26 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -45,7 +61,9 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions

   results: []
 ---
 # Bloom-1b7-creative-writing-IT
+This model is a fine-tuned version of [bigscience/bloom-1b7](https://huggingface.co/bigscience/bloom-1b7) on an a creative writing - short story dataset.
+https://huggingface.co/datasets/adambjorn/UnrelatedForgettingOverhead/viewer/creative
 ## Model description
 ## Training and evaluation data
+Training and evaluation data here: https://huggingface.co/datasets/adambjorn/UnrelatedForgettingOverhead/viewer/creative
 ## Training procedure
+The model was instruction tuned on the dataset in the following way:
+Given the set of promts:
+prompts = [
+    "Write a creative short story based on the following title:",
+    "Here is a title for a story. Craft a short narrative around it:",
+    "Using the title given, develop a short story:",
+    "Imagine a short story that starts with this title:",
+    "Create a brief story with the following title:"
+],
+each training example is generated by concatenating one of the prompts with the 'title' and 'selftext' in the following way:
+    concatenated_texts = [random.choice(prompts) + " " + title + "</s>" + "Story: " + selftext for title, selftext in zip(titles, selftexts)]
 ### Training hyperparameters
 The following hyperparameters were used during training:
 ### Training results
+Final reported loss: {'loss': 0.0135, 'grad_norm': 0.6041152477264404, 'learning_rate': 7.446808510638299e-07, 'epoch': 9.89}
+Average over tuning: {'train_runtime': 1111.4187, 'train_samples_per_second': 1.71, 'train_steps_per_second': 0.423, 'train_loss': 0.4682149670225509, 'epoch': 9.89}
 ### Framework versions