Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
eswardivi
/
mingpt-openwebtext
like
1
Text Generation
Transformers
PyTorch
Skylion007/openwebtext
gpt2
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
eswardivi
commited on
Jan 8, 2024
Commit
baef585
·
1 Parent(s):
809e1c6
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+12
-0
README.md
ADDED
Viewed
@@ -0,0 +1,12 @@
1
+
# MinGPT
2
+
3
+
Normal GPT 2 Architecture with below config trained on subset of openwebtext
4
+
5
+
```
6
+
n_ctx=256,
7
+
n_positions = 256,
8
+
n_layer = 6,
9
+
n_embd = 384,
10
+
n_head = 6,
11
+
12
+
```