Brownwang0426
/

Llama-3-Taiwan-8B-Instruct-to-1B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Brownwang0426 commited on Aug 16

Commit

06a09fb

•

1 Parent(s): dffd434

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -6,7 +6,13 @@ tags: []
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details

 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This is a testing model by:
+- **shrinking the layers of llama-3-8b to only 2 layers of transformer decoder**
+- **adding a customized layer to llama attention**
+- **shrinking the total param size to around 2b... so pathetic** 😢😢😢
+The purpose of this model is to show how you can download a pre-trained llama and customize it... however you want... and then re-train it with... whatever you want... and then upload your model to hugging face 🤗
 ## Model Details