MicroPanda123
/

PythonBasic

Text Generation

Model card Files Files and versions Community

MicroPanda123 commited on Jul 14, 2023

Commit

c202300

·

1 Parent(s): 5c8c72a

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 license: gpl-2.0
 ---
 Got bored so used [nanoGPT](https://github.com/karpathy/nanoGPT) to train model on all Python snippets from https://www.kaggle.com/datasets/simiotic/github-code-snippets
@@ -12,5 +13,5 @@ batch_size=2
 gradient_accumulation_steps = 64
 ```
 This was because I was training it locally on RTX2060 and did not have enough power to train it more.
-Current model was trained for 8880 iterations.
 At first I made it only save model after validation loss improved, to not allow overfitting, but after some time I decided to risk it and turned that off and allowed it to save everytime, luckly it worked out fine.

 ---
 license: gpl-2.0
+pipeline_tag: text-generation
 ---
 Got bored so used [nanoGPT](https://github.com/karpathy/nanoGPT) to train model on all Python snippets from https://www.kaggle.com/datasets/simiotic/github-code-snippets
 gradient_accumulation_steps = 64
 ```
 This was because I was training it locally on RTX2060 and did not have enough power to train it more.
+Current model was trained for 8880 iterations. Took around 20 hours.
 At first I made it only save model after validation loss improved, to not allow overfitting, but after some time I decided to risk it and turned that off and allowed it to save everytime, luckly it worked out fine.