mmoffatt commited on
Commit
fabe18e
1 Parent(s): 9df72c0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -6,4 +6,10 @@ language:
6
  - en
7
  ---
8
 
9
- This is a custom implementation of gpt2, where we replace attention with our implementation. Currently, we don't replace softmax, but in future submits we would like to replace the softmax function in attention with other softmax variations.
 
 
 
 
 
 
 
6
  - en
7
  ---
8
 
9
+ This is a custom implementation of gpt2, where we replace attention with our implementation. Currently, we don't replace softmax, but in future submits we would like to replace the softmax function in attention with other softmax variations.
10
+
11
+ We directly use the huggingface gpt2 model: https://huggingface.co/openai-community/gpt2
12
+
13
+ This model was finetuned on the wikitext dataset: https://paperswithcode.com/dataset/wikitext-2
14
+
15
+ base model: huggingface gpt2