Adding `safetensors` variant of this model

by aisltnab - opened Dec 6, 2023

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+497

-213

Files changed (7) hide show

LICENSE.txt +110 -25
README.md +8 -40
langdemo.py +0 -148
model-00001-of-00003.safetensors +3 -0
model-00002-of-00003.safetensors +3 -0
model-00003-of-00003.safetensors +3 -0
model.safetensors.index.json +370 -0

LICENSE.txt CHANGED Viewed

@@ -1,41 +1,126 @@
-Nexusflow.ai License Terms
-NexusRaven-V2 Version Release Date: December 5, 2023
-“Agreement” means the terms and conditions for use, reproduction, distribution and modification of the Nexusflow Materials set forth herein.
-“Documentation” means the specifications, manuals and documentation accompanying NeuxsRaven-V2 distributed by Nexusflow at https://huggingface.co/Nexusflow/NexusRaven-V2-13B, if any.
-“Licensee” or “you” means you, or your employer or any other person or entity (if you are entering into this Agreement on such person or entity’s behalf), of the age required under applicable laws, rules or regulations to provide legal consent and that has legal authority to bind your employer or such other person or entity if you are entering in this Agreement on their behalf.
-“NexusRaven-V2” means the large language models and software and algorithms, including machine-learning model code, trained model weights, inference-enabling code, training-enabling code, fine-tuning enabling code and other elements of the foregoing made available by Nexusflow at https://huggingface.co/Nexusflow/NexusRaven-V2-13B.
-“Nexusflow Materials” means, collectively, Nexusflow’s proprietary NexusRaven-V2 and Documentation (and any portion thereof) made available under this Agreement.
-“Nexusflow” or “we” means Nexusflow.ai Inc.
-By using or distributing any portion or element of the Nexusflow Materials, you agree to be bound by this Agreement.
-1. License Rights and Redistribution.
-  a. Grant of Rights. You are granted a non-exclusive, worldwide, non-transferable and royalty-free limited license under Nexusflow’s intellectual property or other rights owned by Nexusflow embodied in the Nexusflow Materials to use, reproduce, distribute, copy, create derivative works of, and make modifications to the Nexusflow Materials.
-  b. Redistribution and Use.
-    i. If you distribute or make the Nexusflow Materials, or any derivative works thereof, available to a third party, you shall provide a copy of this Agreement to such third party.
-    ii. If you receive Nexusflow Materials, or any derivative works thereof, from a Licensee as part of an integrated end user product, then Section 1 of this Agreement will not apply to you.
-    iii. You must retain in all copies of the Nexusflow Materials that you distribute the following attribution notice within a “Notice” text file distributed as a part of such copies: “NexusRaven-V2 is licensed under the Nexusflow License, Copyright © Nexusflow.ai Inc. All Rights Reserved.”
-    iv. Your use of the Nexusflow Materials must comply with applicable laws and regulations (including trade compliance laws and regulations) and adhere to Nexusflow terms and policies (if any), which are hereby incorporated by reference into this Agreement. The Nexusflow Materials are derived from Llama 2 as offered by Meta Platforms Ireland Limited or Meta Platforms, Inc., and you further agree that your use of the Nexusflow Materials shall be subject to the applicable terms and conditions of the Llama 2 Community License Agreement, available at https://ai.meta.com/llama/license/.
-    v. You will not use the Nexusflow Materials or any output or results of the Nexusflow Materials to improve any other large language model (excluding NexusRaven-V2 or derivative works thereof).
-2. Additional Commercial Terms. If, on the NexusRaven-V2 version release date, the monthly active users of the products or services made available by or for Licensee, or Licensee’s affiliates, is greater than 50 million monthly active users in the preceding calendar month, you must request a license from Nexusflow, which Nexusflow may grant to you in its sole discretion, and you are not authorized to exercise any of the rights under this Agreement unless or until Nexusflow otherwise expressly grants you such rights.
-3. Disclaimer of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE NEXUSFLOW MATERIALS AND ANY OUTPUT AND RESULTS THEREFROM ARE PROVIDED ON AN “AS IS” BASIS, WITHOUT WARRANTIES OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR DETERMINING THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE NEXUSFLOW MATERIALS AND ASSUME ANY RISKS ASSOCIATED WITH YOUR USE OF THE NEXUSFLOW MATERIALS AND ANY OUTPUT AND RESULTS.
-4. Limitation of Liability. IN NO EVENT WILL NEXUSFLOW, ITS LICENSORS OR AFFILIATES BE LIABLE UNDER ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY, OR OTHERWISE, ARISING OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT, SPECIAL, CONSEQUENTIAL, INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF NEXUSFLOW OR ITS AFFILIATES HAVE BEEN ADVISED OF THE POSSIBILITY OF ANY OF THE FOREGOING.
 5. Intellectual Property.
-  a. No trademark licenses are granted under this Agreement, and in connection with the Nexusflow Materials, neither Nexusflow nor Licensee may use any name or mark owned by or associated with the other or any of its affiliates, except as required for reasonable and customary use in describing and using the Nexusflow Materials.
-  b. Subject to Nexusflow’s ownership of Nexusflow Materials and derivatives made by or for Nexusflow (and any rights retained therein by its licensors to the foregoing), with respect to any derivative works and modifications of the Nexusflow Materials that are made by you, as between you and Nexusflow, you are and will be the owner of such derivative works and modifications.
-  c. You will indemnify and hold harmless Nexusflow from and against any claim by any third party arising out of or related to your use of the Nexusflow Materials.
-6. Term and Termination. The term of this Agreement will commence upon your acceptance of this Agreement or access to the Nexusflow Materials and will continue in full force and effect until terminated in accordance with the terms and conditions herein. Nexusflow may terminate this Agreement if you are in breach of any term or condition of this Agreement. Upon termination of this Agreement, you shall delete and cease use of the Nexusflow Materials. Sections 3, 4, 5.c. (the last sentence) and 7 shall survive the termination of this Agreement.
-7. Governing Law and Jurisdiction. This Agreement will be governed and construed under the laws of the State of California without regard to choice of law principles, and the UN Convention on Contracts for the International Sale of Goods does not apply to this Agreement. The courts of California shall have exclusive jurisdiction of any dispute arising out of this Agreement.

+LLAMA 2 COMMUNITY LICENSE AGREEMENT
+Llama 2 Version Release Date: July 18, 2023
+"Agreement" means the terms and conditions for use, reproduction, distribution and
+modification of the Llama Materials set forth herein.
+"Documentation" means the specifications, manuals and documentation
+accompanying Llama 2 distributed by Meta at ai.meta.com/resources/models-and-
+libraries/llama-downloads/.
+"Licensee" or "you" means you, or your employer or any other person or entity (if
+you are entering into this Agreement on such person or entity's behalf), of the age
+required under applicable laws, rules or regulations to provide legal consent and that
+has legal authority to bind your employer or such other person or entity if you are
+entering in this Agreement on their behalf.
+"Llama 2" means the foundational large language models and software and
+algorithms, including machine-learning model code, trained model weights,
+inference-enabling code, training-enabling code, fine-tuning enabling code and other
+elements of the foregoing distributed by Meta at ai.meta.com/resources/models-and-
+libraries/llama-downloads/.
+"Llama Materials" means, collectively, Meta's proprietary Llama 2 and
+Documentation (and any portion thereof) made available under this Agreement.
+"Meta" or "we" means Meta Platforms Ireland Limited (if you are located in or, if you
+are an entity, your principal place of business is in the EEA or Switzerland) and Meta
+Platforms, Inc. (if you are located outside of the EEA or Switzerland).
+By clicking "I Accept" below or by using or distributing any portion or element of the
+Llama Materials, you agree to be bound by this Agreement.
+1. License Rights and Redistribution.
+      a. Grant of Rights. You are granted a non-exclusive, worldwide, non-
+transferable and royalty-free limited license under Meta's intellectual property or
+other rights owned by Meta embodied in the Llama Materials to use, reproduce,
+distribute, copy, create derivative works of, and make modifications to the Llama
+Materials.
+      b. Redistribution and Use.
+            i. If you distribute or make the Llama Materials, or any derivative works
+thereof, available to a third party, you shall provide a copy of this Agreement to such
+third party.
+            ii.  If you receive Llama Materials, or any derivative works thereof, from
+a Licensee as part of an integrated end user product, then Section 2 of this
+Agreement will not apply to you.
+            iii. You must retain in all copies of the Llama Materials that you
+distribute the following attribution notice within a "Notice" text file distributed as a
+part of such copies: "Llama 2 is licensed under the LLAMA 2 Community License,
+Copyright (c) Meta Platforms, Inc. All Rights Reserved."
+            iv. Your use of the Llama Materials must comply with applicable laws
+and regulations (including trade compliance laws and regulations) and adhere to the
+Acceptable Use Policy for the Llama Materials (available at
+https://ai.meta.com/llama/use-policy), which is hereby incorporated by reference into
+this Agreement.
+            v. You will not use the Llama Materials or any output or results of the
+Llama Materials to improve any other large language model (excluding Llama 2 or
+derivative works thereof).
+2. Additional Commercial Terms. If, on the Llama 2 version release date, the
+monthly active users of the products or services made available by or for Licensee,
+or Licensee's affiliates, is greater than 700 million monthly active users in the
+preceding calendar month, you must request a license from Meta, which Meta may
+grant to you in its sole discretion, and you are not authorized to exercise any of the
+rights under this Agreement unless or until Meta otherwise expressly grants you
+such rights.
+3. Disclaimer of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE
+LLAMA MATERIALS AND ANY OUTPUT AND RESULTS THEREFROM ARE
+PROVIDED ON AN "AS IS" BASIS, WITHOUT WARRANTIES OF ANY KIND,
+EITHER EXPRESS OR IMPLIED, INCLUDING, WITHOUT LIMITATION, ANY
+WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY, OR
+FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE
+FOR DETERMINING THE APPROPRIATENESS OF USING OR REDISTRIBUTING
+THE LLAMA MATERIALS AND ASSUME ANY RISKS ASSOCIATED WITH YOUR
+USE OF THE LLAMA MATERIALS AND ANY OUTPUT AND RESULTS.
+4. Limitation of Liability. IN NO EVENT WILL META OR ITS AFFILIATES BE
+LIABLE UNDER ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, TORT,
+NEGLIGENCE, PRODUCTS LIABILITY, OR OTHERWISE, ARISING OUT OF THIS
+AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT, SPECIAL,
+CONSEQUENTIAL, INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN
+IF META OR ITS AFFILIATES HAVE BEEN ADVISED OF THE POSSIBILITY OF
+ANY OF THE FOREGOING.
 5. Intellectual Property.
+      a. No trademark licenses are granted under this Agreement, and in
+connection with the Llama Materials, neither Meta nor Licensee may use any name
+or mark owned by or associated with the other or any of its affiliates, except as
+required for reasonable and customary use in describing and redistributing the
+Llama Materials.
+      b. Subject to Meta's ownership of Llama Materials and derivatives made by or
+for Meta, with respect to any derivative works and modifications of the Llama
+Materials that are made by you, as between you and Meta, you are and will be the
+owner of such derivative works and modifications.
+      c. If you institute litigation or other proceedings against Meta or any entity
+(including a cross-claim or counterclaim in a lawsuit) alleging that the Llama
+Materials or Llama 2 outputs or results, or any portion of any of the foregoing,
+constitutes infringement of intellectual property or other rights owned or licensable
+by you, then any licenses granted to you under this Agreement shall terminate as of
+the date such litigation or claim is filed or instituted. You will indemnify and hold
+harmless Meta from and against any claim by any third party arising out of or related
+to your use or distribution of the Llama Materials.
+6. Term and Termination. The term of this Agreement will commence upon your
+acceptance of this Agreement or access to the Llama Materials and will continue in
+full force and effect until terminated in accordance with the terms and conditions
+herein. Meta may terminate this Agreement if you are in breach of any term or
+condition of this Agreement. Upon termination of this Agreement, you shall delete
+and cease use of the Llama Materials. Sections 3, 4 and 7 shall survive the
+termination of this Agreement.
+7. Governing Law and Jurisdiction. This Agreement will be governed and
+construed under the laws of the State of California without regard to choice of law
+principles, and the UN Convention on Contracts for the International Sale of Goods
+does not apply to this Agreement. The courts of California shall have exclusive
+jurisdiction of any dispute arising out of this Agreement.

README.md CHANGED Viewed

@@ -1,11 +1,9 @@
 ---
-license: other
 base_model: codellama/CodeLlama-13b-Instruct-hf
 model-index:
 - name: NexusRaven-13B
   results: []
-tags:
-- function calling
 ---
 # NexusRaven-13B: Surpassing GPT-4 for Zero-shot Function Calling
 <p align="center">
@@ -37,13 +35,7 @@ Please checkout the following links!
 ## NexusRaven-V2 model usage
-NexusRaven-V2 accepts a list of python functions.
-These python functions can do anything (including sending GET/POST requests to external APIs!).
-The two requirements include the python function signature and the appropriate docstring to generate the function call.
-NexusRaven-V2 also does best on functions with arguments, so please always only provide functions that require arguments to raven.
 ### NexusRaven-V2's Capabilities
@@ -51,32 +43,11 @@ NexusRaven-V2 is capable of generating deeply nested function calls, parallel fu
 ### Quick Start Prompting Guide
-Please refer to our notebook, [How-To-Prompt.ipynb](https://colab.research.google.com/drive/19JYixRPPlanmW5q49WYi_tU8rhHeCEKW?usp=sharing), for more advanced tutorials on using NexusRaven-V2!
-1. When giving docstrings to Raven, please provide well-indented, detailed, and well-written docstrings as this can help accuracy.
-2. Raven does better when all functions provided to it has arguments, either required or optional, (i.e. ```func(dummy_arg)``` is preferred over ```func()```) as this can help accuracy.
-3. We strongly recommend to set sampling to False when prompting NexusRaven-V2.
-4. We strongly recommend a very low temperature (~0.001).
-5. We strongly recommend following the prompting style below.
-When handling irrelevant user queries, users have noticed that specifying a "no-op" function with arguments work best. For example, something like this might work:
-```python
-def no_relevant_function(user_query : str):
-  """
-  Call this when no other provided function can be called to answer the user query.
-  Args:
-     user_query: The user_query that cannot be answered by any other function calls.
-  """
-```
-Please ensure to provide an argument to this function, as Raven works best on functions with arguments.
-For parallel calls, due to the model being targeted for industry use, you can "enable" parallel calls by adding this into the prompt:
-```python
-"Setting: Allowed to issue multiple calls with semicolon\n"
-```
-This can be added above the User Query to "allow" the model to use parallel calls, otherwise, the model will focus on nested and single calls primarily.
 ### Quickstart
 You can run the model on a GPU using the following code.
@@ -147,9 +118,6 @@ Please follow this prompting template to maximize the performance of RavenV2.
 [If you currently have a workflow that is built around OpenAI's function calling and you want to try NexusRaven-V2, we have a package that helps you drop in NexusRaven-V2.](https://github.com/nexusflowai/nexusraven-pip)
-### Using With LangChain
-We've also included a [small demo for using Raven with langchain](langdemo.py)!
 ## Evaluation
@@ -166,7 +134,7 @@ For a deeper dive into the results, please see our [Github README](https://githu
 3. The explanations generated by NexusRaven-V2 might be incorrect. Please ensure proper guardrails are present to capture errant behavior.
 ## License
-This model was trained on commercially viable data and is licensed under the [Nexusflow community license](https://huggingface.co/Nexusflow/NexusRaven-V2-13B/blob/main/LICENSE.txt).
 ## References
@@ -195,4 +163,4 @@ We thank the CodeLlama team for their amazing models!
 ```
 ## Contact
-Please join our [Discord Channel](https://discord.gg/HDSVmNAs3y) to reach out for any issues and comments!

 ---
+license: llama2
 base_model: codellama/CodeLlama-13b-Instruct-hf
 model-index:
 - name: NexusRaven-13B
   results: []
 ---
 # NexusRaven-13B: Surpassing GPT-4 for Zero-shot Function Calling
 <p align="center">
 ## NexusRaven-V2 model usage
+NexusRaven-V2 accepts a list of python functions. These python functions can do anything (including sending GET/POST requests to external APIs!). The two requirements include the python function signature and the appropriate docstring to generate the function call.
 ### NexusRaven-V2's Capabilities
 ### Quick Start Prompting Guide
+Please refer to our notebook, [How-To-Prompt.ipynb](How-To-Prompt.ipynb), for more advanced tutorials on using NexusRaven-V2!
+1. We strongly recommend to set sampling to False when prompting NexusRaven-V2.
+2. We strongly recommend a very low temperature (~0.001).
+3. We strongly recommend following the prompting style below.
 ### Quickstart
 You can run the model on a GPU using the following code.
 [If you currently have a workflow that is built around OpenAI's function calling and you want to try NexusRaven-V2, we have a package that helps you drop in NexusRaven-V2.](https://github.com/nexusflowai/nexusraven-pip)
 ## Evaluation
 3. The explanations generated by NexusRaven-V2 might be incorrect. Please ensure proper guardrails are present to capture errant behavior.
 ## License
+This model was trained on commercially viable data and is licensed under the [Llama 2 community license](https://huggingface.co/codellama/CodeLlama-13b-hf/blob/main/LICENSE) following the original [CodeLlama-13b-hf](https://huggingface.co/codellama/CodeLlama-13b-hf/) model.
 ## References
 ```
 ## Contact
+Please join our [Discord Channel](https://discord.gg/HDSVmNAs3y) to reach out for any issues and comments!

langdemo.py DELETED Viewed

@@ -1,148 +0,0 @@
-from typing import List, Literal, Union
-import math
-from langchain.tools.base import StructuredTool
-from langchain.agents import (
-    Tool,
-    AgentExecutor,
-    LLMSingleActionAgent,
-    AgentOutputParser,
-)
-from langchain.schema import AgentAction, AgentFinish, OutputParserException
-from langchain.prompts import StringPromptTemplate
-from langchain.llms import HuggingFaceTextGenInference
-from langchain.chains import LLMChain
-##########################################################
-# Step 1: Define the functions you want to articulate. ###
-##########################################################
-def calculator(
-    input_a: float,
-    input_b: float,
-    operation: Literal["add", "subtract", "multiply", "divide"],
-):
-    """
-    Computes a calculation.
-    Args:
-    input_a (float) : Required. The first input.
-    input_b (float) : Required. The second input.
-    operation (string): The operation. Choices include: add to add two numbers, subtract to subtract two numbers, multiply to multiply two numbers, and divide to divide them.
-    """
-    match operation:
-        case "add":
-            return input_a + input_b
-        case "subtract":
-            return input_a - input_b
-        case "multiply":
-            return input_a * input_b
-        case "divide":
-            return input_a / input_b
-def cylinder_volume(radius, height):
-    """
-    Calculate the volume of a cylinder.
-    Parameters:
-    - radius (float): The radius of the base of the cylinder.
-    - height (float): The height of the cylinder.
-    Returns:
-    - float: The volume of the cylinder.
-    """
-    if radius < 0 or height < 0:
-        raise ValueError("Radius and height must be non-negative.")
-    volume = math.pi * (radius**2) * height
-    return volume
-#############################################################
-# Step 2: Let's define some utils for building the prompt ###
-#############################################################
-RAVEN_PROMPT = """
-{raven_tools}
-User Query: {input}<human_end>
-"""
-# Set up a prompt template
-class RavenPromptTemplate(StringPromptTemplate):
-    # The template to use
-    template: str
-    # The list of tools available
-    tools: List[Tool]
-    def format(self, **kwargs) -> str:
-        prompt = ""
-        for tool in self.tools:
-            func_signature, func_docstring = tool.description.split(" - ", 1)
-            prompt += f'\nFunction:\ndef {func_signature}\n"""\n{func_docstring}\n"""\n'
-        kwargs["raven_tools"] = prompt
-        return self.template.format(**kwargs).replace("{{", "{").replace("}}", "}")
-class RavenOutputParser(AgentOutputParser):
-    def parse(self, llm_output: str) -> Union[AgentAction, AgentFinish]:
-        # Check if agent should finish
-        if "Call:" in llm_output:
-            return AgentFinish(
-                return_values={
-                    "output": llm_output.strip()
-                    .replace("Call:", "")
-                    .strip()
-                },
-                log=llm_output,
-            )
-        else:
-            raise OutputParserException(f"Could not parse LLM output: `{llm_output}`")
-##################################################
-# Step 3: Build the agent with these utilities ###
-##################################################
-inference_server_url = "https://rjmy54al17scvxjr.us-east-1.aws.endpoints.huggingface.cloud"
-assert (
-    inference_server_url is not "<YOUR ENDPOINT URL>"
-), "Please provide your own HF inference endpoint URL!"
-llm = HuggingFaceTextGenInference(
-    inference_server_url=inference_server_url,
-    temperature=0.001,
-    max_new_tokens=400,
-    do_sample=False,
-)
-tools = [
-    StructuredTool.from_function(calculator),
-    StructuredTool.from_function(cylinder_volume),
-]
-raven_prompt = RavenPromptTemplate(
-    template=RAVEN_PROMPT, tools=tools, input_variables=["input"]
-)
-llm_chain = LLMChain(llm=llm, prompt=raven_prompt)
-output_parser = RavenOutputParser()
-agent = LLMSingleActionAgent(
-    llm_chain=llm_chain,
-    output_parser=output_parser,
-    stop=["<bot_end>"],
-    allowed_tools=tools,
-)
-agent_chain = AgentExecutor.from_agent_and_tools(agent=agent, tools=tools, verbose=True)
-call = agent_chain.run(
-    "I have a cake that is about 3 centimenters high and 200 centimeters in radius. How much cake do I have?"
-)
-print(eval(call))
-call = agent_chain.run("What is 1+10?")
-print(eval(call))

model-00001-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:14c0ff1fa640063c6084b6513fe35122dc5625f29b9af8317ee2c0a8444c7216
+size 9948933792

model-00002-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1f995c96612e274a416caa74e11718a5e7a514023357d93b24139e36e91fe8d0
+size 9904123752

model-00003-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f94fd7c010d2b7c0c801c3dd94cd3fadbe626c2076bec0cf6e3b465a41053867
+size 6179204888

model.safetensors.index.json ADDED Viewed

	@@ -0,0 +1,370 @@

+{
+    "metadata": {
+        "total_size": 26032220160
+    },
+    "weight_map": {
+        "lm_head.weight": "model-00003-of-00003.safetensors",
+        "model.embed_tokens.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.12.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.13.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.14.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.15.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.15.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.15.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.2.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.20.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.24.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.25.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.26.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.27.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.28.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.29.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.3.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.30.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.30.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.30.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.30.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.30.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.30.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.31.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.32.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.33.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.34.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.35.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.36.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.37.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.38.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.39.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.4.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.norm.weight": "model-00003-of-00003.safetensors"
+    }
+}