alonzogarbanzo
/

Bloom-1b7-creative-writing

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alonzogarbanzo commited on Mar 3, 2024

Commit

dfec861

·

verified ·

1 Parent(s): 01995b2

Update README.md

Files changed (1) hide show

README.md +21 -3

README.md CHANGED Viewed

@@ -22,12 +22,30 @@ Intended for use on a student group project for Portland State University's Wint
 ## Training and evaluation data
-More information needed
 ## Training procedure
 Trained on a single RTX 3090 card.
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -43,10 +61,10 @@ The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
-After final epoch:
 {'loss': 0.0472, 'learning_rate': 1.4893617021276598e-06, 'epoch': 4.95}
-After full completion
 {'train_runtime': 563.2707,
 'train_samples_per_second': 1.687,
 'train_steps_per_second': 0.417,

 ## Training and evaluation data
+Instruction Tuned on the creative writing dataset here: https://huggingface.co/datasets/adambjorn/UnrelatedForgettingOverhead/viewer/creative
 ## Training procedure
 Trained on a single RTX 3090 card.
+Given a set of prompts:
+```python
+prompts = [
+    "Write a creative short story based on the following title:",
+    "Here is a title for a story. Craft a short narrative around it:",
+    "Using the title given, develop a short story:",
+    "Imagine a short story that starts with this title:",
+    "Create a brief story with the following title:"
+]
+```
+Concatenate the prompt, the title and the story like so:
+```python
+concatenated_texts = [random.choice(prompts) + " " + title + "</s>" + "Story: " + selftext for title, selftext in zip(titles, selftexts)]
+```
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
+Final results:
 {'loss': 0.0472, 'learning_rate': 1.4893617021276598e-06, 'epoch': 4.95}
+Average results:
 {'train_runtime': 563.2707,
 'train_samples_per_second': 1.687,
 'train_steps_per_second': 0.417,