kettleguts
/

zephyr-7b-beta_sparse05

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kettleguts commited on Mar 27

Commit

0410488

•

1 Parent(s): ccdcad2

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -15,6 +15,7 @@ language:
 # Model Card for kettleguts/zephyr-7b-beta_sparse05
 This is a pruned version of HuggingFaceH4/zephyr-7b-beta found [here](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta). Wanda pruning was used to introduce 50% sparsity into the linear layers. Read the paper [here](https://arxiv.org/abs/2306.11695).
@@ -41,7 +42,7 @@ This model should never be used for critical decisions involving health, life, e
 ## Bias, Risks, and Limitations
-[No safegaurd have been added to this model.](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta#bias-risks-and-limitations)
 ## How to Get Started with the Model
@@ -88,7 +89,7 @@ Output:
 Pending
-## Model Examination [optional]
 <!-- Relevant interpretability work for the model goes here -->
 Pending
@@ -97,7 +98,7 @@ Pending
 The calculations necessary to prune this model required less than 1 hour of time on a T4 GPU in Colab.
-## Technical Specifications [optional]
 #### Software
@@ -105,7 +106,7 @@ The calculations necessary to prune this model required less than 1 hour of time
 The bulk of this work was done using [Pytorch](https://pytorch.org/). They have an array of built-in [pruning tools](https://pytorch.org/docs/stable/nn.html#:~:text=Utility%20classes%20and%20functions%20for%20pruning%20Module%20parameters
 ) in torch.nn . Also check out the [tutorial](https://pytorch.org/tutorials/intermediate/pruning_tutorial.html) by [Michela Paganini](https://github.com/mickypaganini).
-## Citation [optional]
 **BibTeX:**
 <code>

 # Model Card for kettleguts/zephyr-7b-beta_sparse05
 This is a pruned version of HuggingFaceH4/zephyr-7b-beta found [here](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta). Wanda pruning was used to introduce 50% sparsity into the linear layers. Read the paper [here](https://arxiv.org/abs/2306.11695).
 ## Bias, Risks, and Limitations
+[No safegaurds have been added to this model.](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta#bias-risks-and-limitations)
 ## How to Get Started with the Model
 Pending
+## Model Examination
 <!-- Relevant interpretability work for the model goes here -->
 Pending
 The calculations necessary to prune this model required less than 1 hour of time on a T4 GPU in Colab.
+## Technical Specifications
 #### Software
 The bulk of this work was done using [Pytorch](https://pytorch.org/). They have an array of built-in [pruning tools](https://pytorch.org/docs/stable/nn.html#:~:text=Utility%20classes%20and%20functions%20for%20pruning%20Module%20parameters
 ) in torch.nn . Also check out the [tutorial](https://pytorch.org/tutorials/intermediate/pruning_tutorial.html) by [Michela Paganini](https://github.com/mickypaganini).
+## Citation
 **BibTeX:**
 <code>