MB-Zephyria-45b / README.md
Steelskull's picture
Update README.md
cd730f9 verified
---
base_model:
- unsloth/Mistral-Small-Instruct-2409
library_name: transformers
tags:
- mergekit
- merge
---
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>MB-Zephyria-45b Data Card</title>
<link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@400;500;600&display=swap" rel="stylesheet">
<style>
body, html {
height: 100%;
margin: 0;
padding: 0;
font-family: 'Quicksand', sans-serif;
background: linear-gradient(135deg, #0a1128 0%, #1c2541 100%);
color: #e0e1dd;
font-size: 16px;
}
.container {
width: 100%;
height: 100%;
padding: 20px;
margin: 0;
background-color: rgba(255, 255, 255, 0.05);
border-radius: 12px;
box-shadow: 0 4px 10px rgba(0, 0, 0, 0.3);
backdrop-filter: blur(10px);
border: 1px solid rgba(255, 255, 255, 0.1);
}
.header h1 {
font-size: 28px;
color: #4cc9f0;
margin: 0 0 20px 0;
text-shadow: 2px 2px 4px rgba(0, 0, 0, 0.3);
}
.update-section h2 {
font-size: 24px;
color: #7209b7;
}
.update-section p {
font-size: 16px;
line-height: 1.6;
color: #e0e1dd;
}
.info img {
width: 100%;
border-radius: 10px;
margin-bottom: 15px;
}
a {
color: #4cc9f0;
text-decoration: none;
}
a:hover {
color: #f72585;
}
.button {
display: inline-block;
background-color: #3a0ca3;
color: #e0e1dd;
padding: 10px 20px;
border-radius: 5px;
cursor: pointer;
text-decoration: none;
}
.button:hover {
background-color: #7209b7;
}
pre {
background-color: #1c2541;
padding: 10px;
border-radius: 5px;
overflow-x: auto;
}
code {
font-family: 'Courier New', monospace;
color: #e0e1dd;
}
</style>
</head>
<body>
<div class="container">
<div class="header">
<h1>MB-Zephyria-45b [EXPERIMENTAL]</h1>
</div>
<div class="info">
<img src="https://cdn-uploads.huggingface.co/production/uploads/64545af5ec40bbbd01242ca6/6W3orrbf8A68l-3p_JxN1.png">
<h2>Model Information</h2>
<p><strong>Base Model:</strong> unsloth/Mistral-Small-Instruct-2409</p>
<p><strong>Strategy:</strong> Modified Balanced Approach with Extended Duplication</p>
<p><strong>Total Layers:</strong> 55</p>
<p><strong>Duplication Start:</strong> Layer 19 (34.5% of model)</p>
<p><strong>Duplicated Layers:</strong> 30 (54.5% of model)</p>
<p><strong>Unique Final Layers:</strong> 7 (11% of model)</p>
<h2>Model Characteristics</h2>
<ul>
<li>Models down_proj and o_proj layers have been nulled and will require healing</li>
<li>Extends duplication further into later layers compared to the Balanced Approach</li>
<li>Aims to enhance both understanding and creativity</li>
<li>Maintains substantial unique initial layers for foundational processing</li>
<li>Potentially suitable for complex reasoning and generative tasks</li>
</ul>
<h2>Configuration Visualization</h2>
<pre><code>
[ Unique ][ Duplicated ][Unique]
0 ----------- 18 19 ------------------- 48 49 --- 54
34.5% 54.5% 11%
</code></pre>
</div>
</div>
</body>
</html>