mlabonne
/

BigQwen2.5-125B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mlabonne commited on Sep 23, 2024

Commit

df5eefc

•

1 Parent(s): ef797d7

Update README.md

Files changed (1) hide show

README.md +41 -12

README.md CHANGED Viewed

@@ -1,27 +1,31 @@
 ---
-base_model:
-- Qwen/Qwen2.5-72B-Instruct
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# merge
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the passthrough merge method.
-### Models Merged
-The following models were included in the merge:
-* [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct)
-### Configuration
 The following YAML configuration was used to produce this model:
@@ -52,3 +56,28 @@ merge_method: passthrough
 dtype: bfloat16
 ```

 ---
+license: other
+license_name: tongyi-qianwen
+license_link: https://huggingface.co/Qwen/Qwen2-72B-Instruct/blob/main/LICENSE
+language:
+- en
+pipeline_tag: text-generation
 library_name: transformers
 tags:
 - mergekit
 - merge
+- lazymergekit
+base_model:
+- Qwen/Qwen2.5-72B-Instruct
 ---
+# BigQwen2.5-120B-Instruct
+BigQwen2.5-120B-Instruct is a [Qwen/Qwen2-72B-Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) self-merge made with [MergeKit](https://github.com/arcee-ai/mergekit/tree/main).
+It applies the [mlabonne/Meta-Llama-3-120B-Instruct](https://huggingface.co/mlabonne/Meta-Llama-3-120B-Instruct/) recipe.
+I made it due to popular demand but I haven't tested it so use it at your own risk. ¯\\\_(ツ)_/¯
+## 🔍 Applications
+It might be good for creative writing tasks. I recommend a context length of 32k but you can go up to 131,072 tokens in theory.
+## 🧩 Configuration
 The following YAML configuration was used to produce this model:
 dtype: bfloat16
 ```
+## 💻 Usage
+```python
+!pip install -qU transformers accelerate
+from transformers import AutoTokenizer
+import transformers
+import torch
+model = "mlabonne/BigQwen2.5-120B-Instruct"
+messages = [{"role": "user", "content": "What is a large language model?"}]
+tokenizer = AutoTokenizer.from_pretrained(model)
+prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+pipeline = transformers.pipeline(
+    "text-generation",
+    model=model,
+    torch_dtype=torch.float16,
+    device_map="auto",
+)
+outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
+print(outputs[0]["generated_text"])
+```