MicroPanda123
/

PythonBasic

Text Generation

Model card Files Files and versions Community

MicroPanda123 commited on Jul 14, 2023

Commit

d2e1b5a

·

1 Parent(s): c202300

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,6 +12,6 @@ eval_iters=40
 batch_size=2
 gradient_accumulation_steps = 64
 ```
-This was because I was training it locally on RTX2060 and did not have enough power to train it more.
 Current model was trained for 8880 iterations. Took around 20 hours.
 At first I made it only save model after validation loss improved, to not allow overfitting, but after some time I decided to risk it and turned that off and allowed it to save everytime, luckly it worked out fine.

 batch_size=2
 gradient_accumulation_steps = 64
 ```
+This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
 Current model was trained for 8880 iterations. Took around 20 hours.
 At first I made it only save model after validation loss improved, to not allow overfitting, but after some time I decided to risk it and turned that off and allowed it to save everytime, luckly it worked out fine.