add model

Files changed (7) hide show

README.md CHANGED Viewed

@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 # gpt-neo-125M-finetuned-pgt
-This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6140
 ## Model description
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 26   | 1.6726          |
-| No log        | 2.0   | 52   | 1.6232          |
-| No log        | 3.0   | 78   | 1.6140          |
 ### Framework versions

 # gpt-neo-125M-finetuned-pgt
+This model is a fine-tuned version of [pritoms/gpt-neo-125M-finetuned-pgt](https://huggingface.co/pritoms/gpt-neo-125M-finetuned-pgt) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6026
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 26   | 1.5947          |
+| No log        | 2.0   | 52   | 1.5963          |
+| No log        | 3.0   | 78   | 1.6026          |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "EleutherAI/gpt-neo-125M",
   "activation_function": "gelu_new",
   "architectures": [
     "GPTNeoForCausalLM"

 {
+  "_name_or_path": "pritoms/gpt-neo-125M-finetuned-pgt",
   "activation_function": "gelu_new",
   "architectures": [
     "GPTNeoForCausalLM"

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c5a3af5d31c1957d639fff20bfb3a5622267bc1f945999713ec1f336a4fb6c12
 size 526017245

 version https://git-lfs.github.com/spec/v1
+oid sha256:1b0e9f93c6d6ca7e1b53ca0e3c88bc5234e68c0ae0a9026435c16d0862039bb9
 size 526017245

runs/Sep07_08-19-22_e8666d15b86d/1631002766.9698904/events.out.tfevents.1631002766.e8666d15b86d.91.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:efb957f758ed65f7349cc7ebac7bbb9e4a3b8462cc5c85885c7574b244996923
+size 4187

runs/Sep07_08-19-22_e8666d15b86d/events.out.tfevents.1631002766.e8666d15b86d.91.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0c42c0cfc9f4f697aa76f578413fdce39c2bf2a59d2713feead26be162536789
+size 4546

runs/Sep07_08-19-22_e8666d15b86d/events.out.tfevents.1631002840.e8666d15b86d.91.5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c5b336643673e8e0ac188d6634750d4f41e5fcd24b78e2a4fa62635d0b6429c1
+size 306

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c1046f63b508a80a18e8c9b553b593313548b5af1a9cb5a3dcb513474c6ff40
 size 2671

 version https://git-lfs.github.com/spec/v1
+oid sha256:1b64c32d1a356b30e858d9285363d7e9142573500b37fcd04a784b94e911a321
 size 2671