Brownwang0426
commited on
Commit
•
06a09fb
1
Parent(s):
dffd434
Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,13 @@ tags: []
|
|
6 |
# Model Card for Model ID
|
7 |
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
|
|
9 |
|
|
|
|
|
|
|
|
|
|
|
10 |
|
11 |
|
12 |
## Model Details
|
|
|
6 |
# Model Card for Model ID
|
7 |
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
+
This is a testing model by:
|
10 |
|
11 |
+
- **shrinking the layers of llama-3-8b to only 2 layers of transformer decoder**
|
12 |
+
- **adding a customized layer to llama attention**
|
13 |
+
- **shrinking the total param size to around 2b... so pathetic** 😢😢😢
|
14 |
+
|
15 |
+
The purpose of this model is to show how you can download a pre-trained llama and customize it... however you want... and then re-train it with... whatever you want... and then upload your model to hugging face 🤗
|
16 |
|
17 |
|
18 |
## Model Details
|