Epoch | Training Loss | Validation Loss | Perplexity |
---|---|---|---|
1 | 3.450200 | 3.380510 | |
2 | 3.309700 | 3.281852 | |
3 | 3.238600 | 3.230116 | |
4 | 3.200200 | 3.195386 | |
5 | 3.148400 | 3.170542 | |
6 | 3.125000 | 3.151122 | 23.36 |
7 | 3.102700 | 3.136005 | |
8 | 3.091500 | 3.123388 | |
9 | 3.067200 | 3.112496 | |
10 | 3.057800 | 3.103361 | |
11 | 3.039200 | 3.095544 | |
12 | 3.023500 | 3.088650 | 21.95 |
13 | 3.012200 | 3.082917 | |
14 | 3.009600 | 3.077587 | |
15 | 2.998100 | 3.073186 | |
16 | 2.987500 | 3.069203 | |
17 | 2.975100 | 3.065609 | |
18 | 2.974400 | 3.062548 | 21.38 |
19 | 2.969300 | 3.059562 | |
20 | 2.956800 | 3.057292 | |
21 | 2.950900 | 3.054723 | |
22 | 2.952200 | 3.052954 | |
23 | 2.944300 | 3.051039 | |
24 | 2.939600 | 3.049278 | 21.10 |
25 | 2.923400 | 3.047985 | |
26 | 2.919100 | 3.046863 | |
27 | 2.932800 | 3.045910 | |
28 | 2.922700 | 3.045190 | |
29 | 2.917000 | 3.044068 | |
30 | 2.922100 | 3.043669 | 20.98 |
31 | 2.910200 | 3.043278 | |
32 | 2.911400 | 3.042759 | |
33 | 2.913500 | 3.042451 | |
34 | 2.902300 | 3.042186 | |
35 | 2.914200 | 3.042054 | |
36 | 2.905900 | 3.042003 | 20.95 |
- Downloads last month
- 8
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.