LLENN-v0.69420-Qwen2.5-72b
Model stock merge for fun. Probably final model mix.
This merge is an answer to people's requests. I really don't wanna do more merges without myself considering to use it.
Models Merged
The following models were included in the merge:
- rombodawg/Rombos-LLM-V2.5-Qwen-72b
- abacusai/Dracarys2-72B-Instruct
- EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
- ZeusLabs/Chronos-Platinum-72B
- anthracite-org/magnum-v4-72b
- m8than/banana-2-b-72b
Configuration
The following YAML configuration was used to produce this model:
models:
- model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
- model: ZeusLabs/Chronos-Platinum-72B
- model: anthracite-org/magnum-v4-72b
- model: abacusai/Dracarys2-72B-Instruct
- model: rombodawg/Rombos-LLM-V2.5-Qwen-72b
- model: m8than/banana-2-b-72b
merge_method: model_stock
base_model: Qwen/Qwen2.5-72B
parameters:
normalize: true
dtype: bfloat16
Prompt Format
ChatML works for the most part.
Sampler Settings
Personally I use the following:
Temp: 1.2
Min P: 0.07
Rep Pen: 1.1
Others have suggested the following:
Temp: 1.1
Top P: 0.98
Min P: 0.05
- Downloads last month
- 201
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.