ibivibiv commited on
Commit
617f997
1 Parent(s): dc19756

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -7,6 +7,8 @@ tags:
7
  ---
8
  # Aegolius Acadicus 30B
9
 
 
 
10
  ![img](./aegolius-acadicus.png)
11
 
12
  I like to call this model "The little professor". It is simply a MOE merge of lora merged models across Llama2 and Mistral. I am using this as a test case to move to larger models and get my gate discrimination set correctly. This model is best suited for knowledge related use cases, I did not give it a specific workload target as I did with some of the other models in the "Owl Series".
 
7
  ---
8
  # Aegolius Acadicus 30B
9
 
10
+ This model placed 16th on the leaderboard when first run, but for some bizarre reason got removed. I really don't appreciate it much since I fund all of my work out of my own pocket and work as hard as anyone else at this. I also share all of my work without restriction. I was honestly stunned that it did so well and then equally as stunned someone took it down. It is just an MOE model just like mixtral. I just happened to land the right gates or something I guess? I am going to resubmit if possible. Again I pay for this on rental gear and runpod.
11
+
12
  ![img](./aegolius-acadicus.png)
13
 
14
  I like to call this model "The little professor". It is simply a MOE merge of lora merged models across Llama2 and Mistral. I am using this as a test case to move to larger models and get my gate discrimination set correctly. This model is best suited for knowledge related use cases, I did not give it a specific workload target as I did with some of the other models in the "Owl Series".