neuralmagic
/

Llama-2-7b-pruned50-retrained

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mgoin commited on Mar 18

Commit

e440afc

•

1 Parent(s): 64364a5

Update README.md

Files changed (1) hide show

README.md +5 -7

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ This repo contains model files for a [Llama 2 7B](https://huggingface.co/meta-ll
 Below we share some code snippets on how to get quickly started with running the model.
-### Sparse Fine-tuning examples
-Coming soon.
 ### Running the model
@@ -53,13 +53,11 @@ Model evaluation metrics and results.
 | [TruthfulQA](https://arxiv.org/abs/2109.07958) | 5-shot        | xxxx        | xxxx                          |
 | [HumanEval](https://arxiv.org/abs/2107.03374)  | pass@1        | xxxx        | xxxx                          |
 | [GSM8K](https://arxiv.org/abs/2110.14168)      | maj@1         | xxxx        | xxxx                          |
-| ------------------------------                 | ------------- | ----------- | ---------                     |
-| **Average**                                    |               | xxxx        | xxxx                          |
-## Model Training Data
 Coming soon.
-## Sparsification
-This model was pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).

 Below we share some code snippets on how to get quickly started with running the model.
+### Sparse Transfer
+You can adapt pruned large language models (LLMs) to new domains and tasks using sparse transfer learning. By leveraging a pre-sparsified model's structure, you can efficiently fine-tune on new data, leading to reduced hyperparameter tuning, training times, and computational costs. Learn about this process [here](https://neuralmagic.github.io/docs-v2/get-started/transfer).
 ### Running the model
 | [TruthfulQA](https://arxiv.org/abs/2109.07958) | 5-shot        | xxxx        | xxxx                          |
 | [HumanEval](https://arxiv.org/abs/2107.03374)  | pass@1        | xxxx        | xxxx                          |
 | [GSM8K](https://arxiv.org/abs/2110.14168)      | maj@1         | xxxx        | xxxx                          |
+## Model Training Details
 Coming soon.
+## Help
+For further support, and discussions on these models and AI in general, join [Neural Magic's Slack Community](https://join.slack.com/t/discuss-neuralmagic/shared_invite/zt-q1a1cnvo-YBoICSIw3L1dmQpjBeDurQ)