pszemraj
/

flan-t5-xl-summary-map-reduce-1024

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Dec 5, 2024

Commit

f30c6d6

·

verified ·

1 Parent(s): 135eaf0

Update README.md

Files changed (1) hide show

README.md +4 -32

README.md CHANGED Viewed

@@ -4,11 +4,12 @@ language:
 - en
 license: apache-2.0
 base_model: google/flan-t5-xl
-tags:
-- generated_from_trainer
 model-index:
 - name: flan-t5-xl-summary-map-reduce-1024
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,22 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 # flan-t5-xl-summary-map-reduce-1024
-This model is a fine-tuned version of [google/flan-t5-xl](https://huggingface.co/google/flan-t5-xl) on the pszemraj/summary-map-reduce dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6039
 - Num Input Tokens Seen: 7138765
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -48,21 +38,3 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
 - num_epochs: 2.0
-### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Input Tokens Seen |
-|:-------------:|:------:|:----:|:---------------:|:-----------------:|
-| 0.8172        | 0.3851 | 100  | 0.6644          | 1364870           |
-| 0.7664        | 0.7702 | 200  | 0.6271          | 2744502           |
-| 0.6584        | 1.1552 | 300  | 0.6146          | 4137699           |
-| 0.6348        | 1.5403 | 400  | 0.6049          | 5518719           |
-| 0.6372        | 1.9254 | 500  | 0.6038          | 6895203           |
-### Framework versions
-- Transformers 4.46.0.dev0
-- Pytorch 2.5.1+cu124
-- Datasets 3.1.0
-- Tokenizers 0.20.2

 - en
 license: apache-2.0
 base_model: google/flan-t5-xl
 model-index:
 - name: flan-t5-xl-summary-map-reduce-1024
   results: []
+datasets:
+- pszemraj/summary-map-reduce-v1
+pipeline_tag: text2text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # flan-t5-xl-summary-map-reduce-1024
+This model is a fine-tuned version of [google/flan-t5-xl](https://huggingface.co/google/flan-t5-xl) on the pszemraj/summary-map-reduce-v1 dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6039
 - Num Input Tokens Seen: 7138765
 ## Training procedure
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
 - num_epochs: 2.0