QuietImpostor commited on
Commit
71d1789
1 Parent(s): ed6595d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -4
README.md CHANGED
@@ -6,7 +6,4 @@ tags:
6
  - conversational
7
  ---
8
  # Info
9
- This is a V2 of the Gemini Nano V2 weights. The reason this is a V2 is the original conversion code was heavily bugged and extremely slow. So Claude 3.5 Sonnet and o1-preview went in and fixed it! Now you'll notice the model has a lot more 2 dimension tensors and should, as a result, be easier to get working as a Gemma2 model!
10
-
11
- # Known issues
12
- The layer norms have an extra 2 dimensions. This will be fixed ASAP!
 
6
  - conversational
7
  ---
8
  # Info
9
+ This is a V2 of the Gemini Nano V2 weights. The reason this is a V2 is the original conversion code was heavily bugged and extremely slow. So Claude 3.5 Sonnet and o1-preview went in and fixed it! Now you'll notice the model has a lot more 2 dimension tensors and should, as a result, be easier to get working as a Gemma2 model!