Forecast-ing
/

modernBERT-content-regression

@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6211
-- Mse: 3.6211
-- Rmse: 1.9029
-- Mae: 1.2271
-- R2: 0.0086
-- Smape: 46.4502
 ## Model description
@@ -40,19 +40,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5.939762584910739e-07
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Mse    | Rmse   | Mae    | R2     | Smape   |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:-------:|
-| 0.0949        | 1.0   | 124  | 3.6211          | 3.6211 | 1.9029 | 1.2271 | 0.0086 | 46.4502 |
 ### Framework versions

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4624
+- Mse: 2.4624
+- Rmse: 1.5692
+- Mae: 1.1822
+- R2: 0.3258
+- Smape: 56.6145
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2.479942619764035e-05
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Mse    | Rmse   | Mae    | R2      | Smape   |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:-------:|:-------:|
+| 0.1152        | 1.0   | 124  | 4.0842          | 4.0842 | 2.0209 | 1.2199 | -0.1182 | 49.0235 |
+| 1.239         | 2.0   | 248  | 3.8036          | 3.8036 | 1.9503 | 1.2892 | -0.0414 | 52.7754 |
+| 27.8256       | 3.0   | 372  | 3.2460          | 3.2460 | 1.8017 | 1.1022 | 0.1113  | 51.7470 |
+| 0.0001        | 4.0   | 496  | 2.4134          | 2.4134 | 1.5535 | 1.0811 | 0.3392  | 52.2215 |
+| 0.1666        | 5.0   | 620  | 2.4624          | 2.4624 | 1.5692 | 1.1822 | 0.3258  | 56.6145 |
 ### Framework versions

runs/Jan09_12-34-06_bazzite/events.out.tfevents.1736455080.bazzite ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0d0636266d00650e5be7bd27c33c6d4d168136d3ee992deb06112e978c8769c9
+size 40