raygx commited on
Commit
2b4d48f
1 Parent(s): 1feb500

Upload model

Browse files
Files changed (3) hide show
  1. README.md +5 -5
  2. config.json +1 -1
  3. tf_model.h5 +1 -1
README.md CHANGED
@@ -11,10 +11,10 @@ probably proofread and complete it, then remove this comment. -->
11
 
12
  # Nepali-GPT2-CausalLM
13
 
14
- This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Train Loss: 4.6517
17
- - Validation Loss: 4.5987
18
  - Epoch: 1
19
 
20
  ## Model description
@@ -41,8 +41,8 @@ The following hyperparameters were used during training:
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
- | 4.7603 | 4.6462 | 0 |
45
- | 4.6517 | 4.5987 | 1 |
46
 
47
 
48
  ### Framework versions
 
11
 
12
  # Nepali-GPT2-CausalLM
13
 
14
+ This model is a fine-tuned version of [raygx/Nepali-GPT2-CausalLM](https://huggingface.co/raygx/Nepali-GPT2-CausalLM) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Train Loss: 4.7022
17
+ - Validation Loss: 4.6237
18
  - Epoch: 1
19
 
20
  ## Model description
 
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
+ | 4.8141 | 4.6678 | 0 |
45
+ | 4.7022 | 4.6237 | 1 |
46
 
47
 
48
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "/kaggle/input/nepali-gpt2-model/gpt2NepaliCasualLM",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
 
1
  {
2
+ "_name_or_path": "raygx/Nepali-GPT2-CausalLM",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5427633d9bcf2c60a6610fc61f37b2659c9cc97f1ac171f2a450a8ba5a1f8233
3
  size 497145936
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be9b84d76a37a4ce63144769fc23dac53785d3699829917a6a0e281fe29e835e
3
  size 497145936