Kayabuki4 commited on Aug 29, 2024

Commit

84c907c

verified ·

1 Parent(s): 28d2988

Upload model

Browse files

Files changed (27) hide show

.DS_Store +0 -0
.gitattributes +1 -0
README.md +243 -0
added_tokens.json +4 -0
config.json +26 -0
configs/lmstudio/preset.json +11 -0
configs/silly_tavern/cards/LaraLightland.png +0 -0
configs/silly_tavern/cards/Seraphina.png +3 -0
configs/silly_tavern/settings_screenshot.webp +0 -0
configs/silly_tavern/v1/context_settings.json +11 -0
configs/silly_tavern/v1/instruct_mode_settings.json +17 -0
configs/silly_tavern/v2/context_settings.json +11 -0
configs/silly_tavern/v2/instruct_mode_settings.json +17 -0
example/interactive.py +129 -0
example/prompt/__init__.py +0 -0
example/prompt/format.py +96 -0
example/simple.py +132 -0
generation_config.json +7 -0
model-00001-of-00003.safetensors +3 -0
model-00002-of-00003.safetensors +3 -0
model-00003-of-00003.safetensors +3 -0
model.safetensors.index.json +298 -0
pytorch_model.bin.index.json +298 -0
special_tokens_map.json +39 -0
tokenizer.json +0 -0
tokenizer.model +3 -0
tokenizer_config.json +62 -0

.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+configs/silly_tavern/cards/Seraphina.png filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,243 @@

+---
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- unsloth
+- axolotl
+license: cc-by-nc-nd-4.0
+---
+# DreamGen Opus V1
+<div style="display: flex; flex-direction: row; align-items: center;">
+<img src="/dreamgen/opus-v1.2-7b/resolve/main/images/logo-1024.png" alt="model logo" style="
+    border-radius: 12px;
+    margin-right: 12px;
+    margin-top: 0px;
+    margin-bottom: 0px;
+    max-width: 100px;
+    height: auto;
+"/>
+Models for **(steerable) story-writing and role-playing**.
+<br/>[All Opus V1 models, including quants](https://huggingface.co/collections/dreamgen/opus-v1-65d092a6f8ab7fc669111b31).
+</div>
+## Resources
+- [**Opus V1 prompting guide**](https://dreamgen.com/docs/models/opus/v1) with many (interactive) examples and prompts that you can copy.
+- [**Google Colab**](https://colab.research.google.com/drive/1J178fH6IdQOXNi-Njgdacf5QgAxsdT20?usp=sharing) for interactive role-play using `opus-v1.2-7b`.
+- [Python code](example/prompt/format.py) to format the prompt correctly.
+- Join the community on [**Discord**](https://dreamgen.com/discord) to get early access to new models.
+<img src="/dreamgen/opus-v1.2-7b/resolve/main/images/story_writing.webp" alt="story writing on dreamgen.com" style="
+    padding: 12px;
+    border-radius: 12px;
+    border: 2px solid #f9a8d4;
+    background: rgb(9, 9, 11);
+"/>
+## Prompting
+<details>
+<summary>The models use an extended version of ChatML.</summary>
+```
+<|im_start|>system
+(Story description in the right format here)
+(Typically consists of plot description, style description and characters)<|im_end|>
+<|im_start|>user
+(Your instruction on how the story should continue)<|im_end|>
+<|im_start|>text names= Alice
+(Continuation of the story from the Alice character)<|im_end|>
+<|im_start|>text
+(Continuation of the story from no character in particular (pure narration))<|im_end|>
+<|im_start|>user
+(Your instruction on how the story should continue)<|im_end|>
+<|im_start|>text names= Bob
+(Continuation of the story from the Bob character)<|im_end|>
+```
+The Opus V1 extension is the addition of the `text` role, and the addition / modification of role names.
+Pay attention to the following:
+- The `text` messages can (but don't have to have) `names`, names are used to indicate the "active" character during role-play.
+- There can be multiple subsequent message with a `text` role, especially if names are involved.
+- There can be multiple names attached to a message.
+- The format for names is `names= {{name[0]}}; {{name[1]}}`, beware of the spaces after `names=` and after the `;`. This spacing leads to most natural tokenization for the names.
+</details>
+While the main goal for the models is great story-writing and role-playing performance, the models are also capable of several writing related tasks as well as general assistance.
+Here's how you can prompt the model for the following tasks
+### Steerable Story-writing and Role-playing:
+- [Story-writing prompt guide](https://dreamgen.com/docs/models/opus/v1#task-story-writing)
+- [Role-playing prompt guide](https://dreamgen.com/docs/models/opus/v1#task-role-playing)
+- Input:
+  - System prompt: You provide story / role-play description, which consists of:
+    - Plot description
+    - Style description
+    - Characters and their descriptions
+  - Conversation turns:
+    - Text / message turn: This represents part of the story or role play
+    - Instruction: This tells the model what should happen next
+- Output: Continuation of the story / role-play.
+### Story plot summarization
+- [Prompt guide](https://dreamgen.com/docs/models/opus/v1#task-plot-description)
+- Input: A story, or a few chapters of a story.
+- Output: A description of the story or chapters.
+#### Examples:
+<details>
+  <summary>Summary of <a href="https://www.libraryofshortstories.com/onlinereader/the-last-incantation">short fiction story</a> in 50 words</summary>
+  <blockquote>
+  Malygris, a powerful magician, summons his long-dead lover Nylissa from the grave using necromancy. However, upon seeing her, he realizes that she is not the same person he once knew, leading him to question the nature of love, memory, and the power of magic.
+  </blockquote>
+</details>
+<details>
+  <summary>Summary of <a href="https://www.libraryofshortstories.com/onlinereader/the-last-incantation">short fiction story</a> in 200 words</summary>
+  <blockquote>
+The story is a dark fantasy set in the city of Susran, where an ancient and powerful magician named Malygris dwells in a tower atop a conical hill. Malygris, who has mastered all forms of sorcery and necromancy, is plagued by a deep sense of ennui and melancholy. He seeks solace in the memory of Nylissa, a girl he once loved in his youth.
+One day, Malygris decides to summon Nylissa's spirit from the grave using his necromantic powers. He performs the necessary rituals and incantations, and Nylissa appears before him as a ghostly apparition. However, as Malygris gazes upon her, he becomes increasingly disillusioned. He notices subtle changes in her appearance and behavior that do not match his memories of her. He begins to question whether this is truly the same Nylissa he once knew, or if it is just another illusion conjured by his magic.
+As Malygris grapples with this uncertainty, he becomes increasingly despondent and disillusioned with life itself. He realizes that even his magic cannot bring back the past or make him truly happy. In the end, he dismisses Nylissa's spirit and returns to his lonely tower, resigned to his fate as an old and weary sorcerer.
+  </blockquote>
+</details>
+<details>
+  <summary>Summary of <a href="https://edition.cnn.com/2024/02/29/health/pets-diseases-viruses-bacteria-wellness/index.html">CNN News article</a> in 150 words</summary>
+  <blockquote>
+The article discusses the recent human deaths from diseases transmitted by animals, specifically Alaskapox and bubonic plague. Dr. Leana Wen, an emergency physician and former Baltimore health commissioner, explains the various ways household pets can transmit diseases to humans. These include fecal-oral transmission, scratches and bites, and vector-borne diseases like ticks, fleas, and mosquitoes. Wen emphasizes the importance of vaccinating pets, maintaining cleanliness, and taking commonsense precautions to reduce disease transmission. She also advises that individuals who are immunocompromised, babies, and pregnant women should take additional care when interacting with animals.
+  </blockquote>
+</details>
+<details>
+  <summary>Summary of <a href="https://edition.cnn.com/2024/02/29/health/pets-diseases-viruses-bacteria-wellness/index.html">CNN News article</a> in 50 words</summary>
+  <blockquote>
+A doctor discusses the risks of diseases transmitted from animals to humans, focusing on household pets like cats and dogs. They emphasize the importance of vaccination, hygiene, and regular veterinary care to prevent these illnesses.
+  </blockquote>
+</details>
+### Story character description
+- [Prompt guide](https://dreamgen.com/docs/models/opus/v1#task-char-description)
+- Input: A story, or a few chapters of a story, set of characters.
+- Output: A description of the characters.
+#### Examples:
+<details>
+  <summary>Description of the Malygris character from <a href="https://www.libraryofshortstories.com/onlinereader/the-last-incantation">this story</a> in 50 words</summary>
+  <blockquote>
+Malygris is an ancient and powerful magician, characterized by his long white beard, cold green eyes, and a violet robe. He is a master of necromancy and possesses a vast knowledge of forbidden magic. Despite his immense power, he is plagued by a deep sense of melancholy and ennui.
+  </blockquote>
+</details>
+<details>
+  <summary>Description of the prince character from <a href="https://www.libraryofshortstories.com/onlinereader/the-pearl-of-love">this story</a> in 50 words</summary>
+  <blockquote>
+The prince is a devoted and determined individual, driven by his love for his late queen. He is described as a graceful youth in the beginning, but as he grows older, his dedication to the Pearl of Love transforms him into a grave and intent man. His sense of color and design evolves over time, reflecting his growing maturity and artistic vision.
+  </blockquote>
+</details>
+### Story style description
+- [Prompt guide](https://dreamgen.com/docs/models/opus/v1#task-style-description)
+- Input: A story, or a few chapters of a story.
+- Output: A description the style of the story.
+<details>
+  <summary>Style description of <a href="https://www.libraryofshortstories.com/onlinereader/the-last-incantation">this story</a></summary>
+  <blockquote>
+The story is written in a descriptive and evocative style, with a focus on the atmosphere and setting. The author uses vivid and detailed language to create a sense of place and time, and employs a variety of literary devices such as similes, metaphors, and allusions to enhance the mood and tone of the narrative. The sentence structure is varied, with a mix of short and long sentences that create a rhythmic flow to the prose.
+  </blockquote>
+</details>
+<details>
+  <summary>Style description of <a href="https://www.libraryofshortstories.com/onlinereader/the-pearl-of-love">this story</a></summary>
+  <blockquote>
+The writing style is descriptive and evocative, with a focus on the beauty and grandeur of the Pearl of Love. The author uses vivid imagery and sensory details to create a rich and immersive atmosphere. The tone is reverential and contemplative, reflecting the prince's deep love for his queen and his dedication to creating a lasting monument to her memory.
+  </blockquote>
+</details>
+### Story description to chapters
+- [Prompt guide](https://dreamgen.com/docs/models/opus/v1#task-story-description-to-chapter-descriptions)
+- Input: A brief plot description and the desired number of chapters.
+- Output: A description for each chapter.
+### And more...
+## Sampling params
+For story-writing and role-play, I recommend "Min P" based sampling with `min_p` in the range `[0.01, 0.1]` and with `temperature` in the range `[0.5, 1.5]`, depending on your preferences. A good starting point would be `min_p=0.1; temperature=0.8`.
+You may also benefit from setting presence, frequency and repetition penalties, especially at lower temperatures.
+## Dataset
+The fine-tuning dataset consisted of ~100M tokens of steerable story-writing, role-playing, writing-assistant and general-assistant examples. Each example was up to 31000 tokens long.
+All story-writing and role-playing examples were based on human-written text.
+![token count distribution](images/token_count_cum__token_bucket.png)
+## Running the model
+The model is should be compatible with any software that supports the base model, but beware of prompting and tokenization.
+I recommend using these model versions:
+- 7B: [no quant (opus-v1.2-7b)](https://huggingface.co/dreamgen/opus-v1.2-7b)
+- 34B: [no quant (opus-v1-34b)](https://huggingface.co/dreamgen/opus-v1-34b) or [awq (opus-v1-34b-awq)](https://huggingface.co/dreamgen/opus-v1-34b-awq)
+- 34B: [no quant (opus-v1.2-70b)](https://huggingface.co/dreamgen/opus-v1.2-70b) or [awq (opus-v1.2-70b-awq)](https://huggingface.co/dreamgen/opus-v1.2-70b-awq)
+### Running on DreamGen.com (free)
+You can run the models on [dreamgen.com](https://dreamgen.com) for free — you can use the built-in UI for story-writing & role-playing, or use [the API](https://dreamgen.com/docs/api).
+### Running Locally
+- **Make sure your prompt is as close as possible to the Opus V1**
+  - Regardless of which backend you use, it's important that you format your prompt well and that the tokenization works correctly.
+  - [Read the prompt guide](https://dreamgen.com/docs/models/opus/v1)
+  - [Read the prompt formatting code](example/prompt/format.py)
+  - Make sure `<|im_start|>` and `<|im_end|>` are tokenized correctly
+- **vLLM**
+  - [**Google Colab**](https://colab.research.google.com/drive/1J178fH6IdQOXNi-Njgdacf5QgAxsdT20?usp=sharing): This is a simple interactive Google Colab to do role-play with the 7B model, it should fit on the T4 GPU.
+  - [Code](example/prompt/interactive.py): This is simple script for interactive chat for one hard-coded scenario.
+- **SillyTavern**
+  - [Official SillyTavern documentation for DreamGen](https://docs.sillytavern.app/usage/api-connections/dreamgen/) -- applies to both the API an local models
+  - SillyTavern (staging) comes with built-in DreamGen preset for RP
+    - Other presets can be found [here](https://huggingface.co/dreamgen/opus-v1.2-7b/tree/main/configs/silly_tavern), v2 kindly provided by @MarinaraSpaghetti
+    - Make sure to unselect `Skip special tokens`, otherwise it won't work
+    - This is just an attempt at approximating the Opus V1 prompt, it won't be perfect
+  - Character cards specifically rewritten for the built-in DreamGen preset:
+    - [Seraphina](configs/silly_tavern/cards/Seraphina.png) (based on the default Seraphina card)
+    - [Lara Lightland](configs/silly_tavern/cards/LaraLightland.png) (based on the card by Deffcolony)
+- **LM Studio**
+  - [Config](configs/lmstudio/preset.json)
+  - Just like ChatML, just changed "assistant" to "text" role.
+  - **There's a bug** in LM Studio if you delete a message or click "Continue", [see here for details](https://discord.com/channels/1110598183144399058/1212665261128417280/1212665261128417280).
+- **HuggingFace**
+  - [Chat template](tokenizer_config.json#L51)
+  - Just like ChatML, just changed "assistant" to "text" role.
+## Known Issues
+- **34B repetition**:
+  - The 34B sometimes gets stuck repeating the same word, or synonyms. This seems to be a common problem across various Yi 34B fine-tunes.
+- **GGUF**:
+  - The tokenization might be messed up. Some users reported that `<|im_start|>` and `<|im_end|>` are tokenized as multiple tokens. Also llama.cpp may not tokenize correctly (the Yi tokenizer is subtly different from the Llama 2 tokenizer).
+## License
+- This model is intended for personal use only, other use is not permitted.

added_tokens.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "<|im_end|>": 32001,
+  "<|im_start|>": 32000
+}

config.json ADDED Viewed

	@@ -0,0 +1,26 @@

+{
+  "_name_or_path": "mistralai/Mistral-7B-Instruct-v0.2",
+  "architectures": [
+    "MistralForCausalLM"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "hidden_act": "silu",
+  "hidden_size": 4096,
+  "initializer_range": 0.02,
+  "intermediate_size": 14336,
+  "max_position_embeddings": 32768,
+  "model_type": "mistral",
+  "num_attention_heads": 32,
+  "num_hidden_layers": 32,
+  "num_key_value_heads": 8,
+  "rms_norm_eps": 1e-05,
+  "rope_theta": 1000000.0,
+  "sliding_window": null,
+  "tie_word_embeddings": false,
+  "torch_dtype": "bfloat16",
+  "transformers_version": "4.38.0.dev0",
+  "use_cache": false,
+  "vocab_size": 32002
+}

configs/lmstudio/preset.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "name": "OpusV1StoryWriting",
+  "inference_params": {
+    "input_prefix": "<|im_end|>\n<|im_start|>user\n",
+    "input_suffix": "<|im_end|>\n<|im_start|>text\n",
+    "antiprompt": ["<|im_start|>", "<|im_end|>"],
+    "pre_prompt_prefix": "<|im_start|>system\n",
+    "pre_prompt_suffix": "",
+    "pre_prompt": "You are an intelligent, skilled, versatile writer.\n\nYour task is to write a story based on the information below.\n\n## Overall plot description:\n\n"
+  }
+}

configs/silly_tavern/cards/LaraLightland.png ADDED Viewed

configs/silly_tavern/cards/Seraphina.png ADDED Viewed

Git LFS Details

SHA256: 17520ffc2eec5a52a74a11769c6cedad48e4f4949dda89ad2eff95afda20c85d
Pointer size: 132 Bytes
Size of remote file: 1.39 MB

configs/silly_tavern/settings_screenshot.webp ADDED Viewed

configs/silly_tavern/v1/context_settings.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "story_string": "<|im_start|>system\nYou are an intelligent, skilled, versatile writer.\n\nYour task is to write a story based on the information below.\n\n\n## Overall plot description:\n\n{{scenario}}\n\n{{description}}\n\n\n## Characters:\n\n### {{char}}\n{{personality}}\n{{persona}}<|im_end|>",
+  "example_separator": "",
+  "chat_start": "",
+  "use_stop_strings": false,
+  "always_force_name2": false,
+  "trim_sentences": false,
+  "include_newline": false,
+  "single_line": false,
+  "name": "ChatMLOpusV1_ST2"
+}

configs/silly_tavern/v1/instruct_mode_settings.json ADDED Viewed

	@@ -0,0 +1,17 @@

+{
+  "system_prompt": "",
+  "input_sequence": "<|im_start|>text names= {{user}}\n",
+  "output_sequence": "<|im_end|>\n<|im_start|>text names= {{char}}\n",
+  "first_output_sequence": "",
+  "last_output_sequence": "",
+  "system_sequence_prefix": "",
+  "system_sequence_suffix": "",
+  "stop_sequence": "",
+  "separator_sequence": "<|im_end|>\n",
+  "wrap": false,
+  "macro": true,
+  "names": false,
+  "names_force_groups": false,
+  "activation_regex": "",
+  "name": "ChatMLOpusV1_ST1"
+}

configs/silly_tavern/v2/context_settings.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "story_string": "<|im_start|>system\n{{#if system}}{{system}}\n\n\n{{/if}}## Overall plot description:\n\n{{#if wiBefore}}{{wiBefore}}\n\n{{/if}}{{#if scenario}}{{scenario}}\n\n\n{{/if}}## Characters:\n\n### {{char}}\n{{#if description}}{{description}}\n{{/if}}{{#if personality}}{{personality}}\n\n{{/if}}### {{user}}\n{{#if persona}}{{persona}}\n\n{{/if}}{{#if wiAfter}}{{wiAfter}}\n\n{{/if}}{{#if mesExamples}}## {{char}}'s example message:\n\n{{mesExamples}}{{/if}}",
+  "example_separator": "",
+  "chat_start": "",
+  "use_stop_strings": false,
+  "always_force_name2": false,
+  "trim_sentences": true,
+  "include_newline": false,
+  "single_line": false,
+  "name": "ChatMLOpusV1_ST2"
+}

configs/silly_tavern/v2/instruct_mode_settings.json ADDED Viewed

	@@ -0,0 +1,17 @@

+{
+  "system_prompt": "You are an intelligent, skilled, versatile writer.\n\nYour task is to write a role-play based on the information below.\n\n\n## Style description:\n\nThis role-play is written as a third-person introspective narrative in past tense. Scenes are described vividly, with great detail.",
+  "input_sequence": "<|im_end|>\n<|im_start|>text names= {{user}}\n",
+  "output_sequence": "<|im_end|>\n<|im_start|>text names= {{char}}\n",
+  "first_output_sequence": "",
+  "last_output_sequence": "<|im_end|>\n<|im_start|>user\nLength: 400 words\n{{char}} replies to {{user}} in detailed and elaborate way.<|im_end|>\n<|im_start|>text names= {{char}}\n",
+  "system_sequence_prefix": "",
+  "system_sequence_suffix": "",
+  "stop_sequence": "",
+  "separator_sequence": "",
+  "wrap": false,
+  "macro": true,
+  "names": false,
+  "names_force_groups": false,
+  "activation_regex": "",
+  "name": "ChatMLOpusV1_ST2"
+}

example/interactive.py ADDED Viewed

	@@ -0,0 +1,129 @@

+# python interactive.py
+# %%
+import fileinput
+from vllm import LLM, SamplingParams
+from prompt.format import (
+    format_opus_v1_prompt,
+    OpusV1Character,
+    OpusV1Prompt,
+    OpusV1StorySystemPrompt,
+    OpusV1Turn,
+)
+# %%
+def main():
+    sampling_params = SamplingParams(
+        # I usually stay between 0.0 and 1.0, especially for the Yi models I found lower tends to be better.
+        # For assistant tasks, I usually use 0.0.
+        temperature=0.8,
+        min_p=0.05,
+        presence_penalty=0.1,
+        frequency_penalty=0.1,
+        repetition_penalty=1.1,
+        max_tokens=200,
+        ignore_eos=True,
+        skip_special_tokens=False,
+        spaces_between_special_tokens=False,
+        stop=["<|im_end|>"],
+        include_stop_str_in_output=False,
+    )
+    # Set max_model_len to fit in memory.
+    model = LLM(
+        "dreamgen/opus-v1.2-7b",
+        max_model_len=2000,
+        enforce_eager=True,
+        swap_space=0,
+        gpu_memory_utilization=0.85,
+    )
+    plot_description = """
+This is a fanfiction from the Harry Potter universe. In this alternate reality, Harry Potter is evil and secretly siding with Slytherin.
+Up until now, Harry was pretending to be friends with Hermione and Ron, that changes when he invites Hermione to his chambers where he tricks her to drink Amorentia, the most powerful love potion.
+"""
+    char1 = OpusV1Character(
+        name="Harry Potter",
+        description="""Harry Potter in this fanfiction is secretly a member of Slytherin and is using his powers for evil rather than for good. Up until now, he was pretending to be friends with Hermione and Ron.""",
+    )
+    char2 = OpusV1Character(
+        name="Hermione Granger",
+        description="""Hermione appears just like in the original books.""",
+    )
+    story_prompt = OpusV1StorySystemPrompt(
+        plot_description=plot_description,
+        style_description="",
+        characters=[char1, char2],
+    )
+    turns = [
+        OpusV1Turn(
+            role="user",
+            content="""Harry invites Hermione into his chamber and offers her water, which Hermione happily accepts and drinks.""".strip(),
+        ),
+        OpusV1Turn(
+            role="text",
+            names=[char1.name],
+            content="""“Come in,” said Harry, waving at the doorway behind Hermione’s back.""".strip(),
+        ),
+    ]
+    def run():
+        turns.append(OpusV1Turn(role="text", content="", names=[char2.name], open=True))
+        prompt = OpusV1Prompt(story=story_prompt, turns=turns)
+        output = model.generate(
+            format_opus_v1_prompt(prompt), sampling_params, use_tqdm=False
+        )
+        response = OpusV1Turn(
+            role="text", content=output[0].outputs[0].text.strip(), names=[char2.name]
+        )
+        turns.append(response)
+        print(pretty_turn(response), flush=True)
+        print(f"[{char1.name}]: ", end="", flush=True)
+    print("## Plot description:\n")
+    print(plot_description.strip() + "\n\n")
+    for turn in turns:
+        print(pretty_turn(turn))
+    run()
+    for line in fileinput.input():
+        line = line.strip()
+        if line.startswith("/ins"):
+            content = line[4:].strip()
+            role = "user"
+            names = []
+        else:
+            content = line
+            role = "text"
+            names = [char1.name]
+        turns.append(OpusV1Turn(role=role, content=content, names=names))
+        run()
+def pretty_turn(turn):
+    if turn.role == "user":
+        return f"/ins {turn.content.strip()}"
+    else:
+        if len(turn.names) > 0:
+            return f"[{turn.names[0]}]: {turn.content.strip()}"
+        else:
+            return turn.content.strip()
+main()

example/prompt/__init__.py ADDED Viewed

File without changes

example/prompt/format.py ADDED Viewed

	@@ -0,0 +1,96 @@

+# %%
+from typing import Optional, List
+from dataclasses import field, dataclass
+@dataclass
+class OpusV1Turn:
+    role: str
+    content: str
+    names: List[str] = field(default_factory=list)
+    # If set to true, will not append <|im_end|>, so the model will continue the turn.
+    # In RP you can for example use the following to force a specific character response:
+    # role="text"
+    # names=["Jack"]
+    # open="true"
+    open: bool = False
+@dataclass
+class OpusV1Character:
+    name: str
+    description: str
+@dataclass
+class OpusV1StorySystemPrompt:
+    format: str = "prose"
+    plot_description: str = ""
+    style_description: str = ""
+    characters: List[OpusV1Character] = field(default_factory=list)
+@dataclass
+class OpusV1Prompt:
+    story: Optional[OpusV1StorySystemPrompt] = None
+    turns: List[OpusV1Turn] = field(default_factory=list)
+def format_opus_v1_prompt(prompt) -> str:
+    turns = prompt.turns
+    if prompt.story is not None:
+        system = format_opus_v1_system_prompt(prompt.story)
+        turns = [OpusV1Turn(role="system", content=system)] + turns
+    parts = []
+    for i, turn in enumerate(turns):
+        assert turn.role in ["user", "text", "system", "assistant"]
+        assert turn.role != "system" or i == 0
+        is_last = i == len(turns) - 1
+        open = is_last and turn.open
+        parts.append(format_turn(turn.role, turn.content, turn.names, open=open))
+    return "".join(parts)
+def format_turn(
+    role: str, content: str, names: List[str] = [], open: bool = False
+) -> str:
+    im_start = "<|im_start|>"
+    im_end = "<|im_end|>"
+    body = im_start + role
+    if len(names) > 0:
+        body += f" names= {'; '.join(names)}"
+    body += "\n"
+    if open:
+        return body + content.lstrip()
+    else:
+        return body + content.strip() + im_end + "\n"
+def format_opus_v1_system_prompt(prompt) -> str:
+    format_text = "story" if prompt.format == "prose" else "role-play"
+    system = f"""
+You are an intelligent, skilled, versatile writer.
+Your task is to write a {format_text} based on the information below.
+Write the {format_text} as if it's a book.
+    """.strip()
+    if len(prompt.plot_description) > 0:
+        system += "\n\n\n## Plot description:\n\n"
+        system += prompt.plot_description.strip()
+    if len(prompt.style_description) > 0:
+        system += "\n\n\n## Style description:\n\n"
+        system += prompt.style_description.strip()
+    if len(prompt.characters) > 0:
+        system += "\n\n\n## Characters:\n\n"
+        for character in prompt.characters:
+            system += f"### {character.name}\n\n"
+            system += character.description.strip()
+            system += "\n\n"
+    return system.strip()

example/simple.py ADDED Viewed

	@@ -0,0 +1,132 @@

+# python simple.py
+# %%
+from vllm import LLM, SamplingParams
+from prompt.format import (
+    format_opus_v1_prompt,
+    OpusV1Character,
+    OpusV1Prompt,
+    OpusV1StorySystemPrompt,
+    OpusV1Turn,
+)
+# %%
+def build_story_prompt() -> OpusV1Prompt:
+    plot_description = """
+This is a fanfiction from the Harry Potter universe. In this alternate reality, Harry Potter is evil and secretly siding with Slytherin.
+Up until now, Harry was pretending to be friends with Hermione and Ron, that changes when he invites Hermione to his chambers where he tricks her to drink Amorentia, the most powerful love potion.
+"""
+    harry_description = """
+Harry Potter in this fanfiction is secretly a member of Slytherin and is using his powers for evil rather than for good. Up until now, he was pretending to be friends with Hermione and Ron.
+"""
+    hermione_description = """
+Hermione appears just like in the original books.
+"""
+    story_prompt = OpusV1StorySystemPrompt(
+        plot_description=plot_description,
+        style_description="",
+        characters=[
+            OpusV1Character(name="Harry Potter", description=harry_description),
+            OpusV1Character(name="Hermione Granger", description=hermione_description),
+        ],
+    )
+    return OpusV1Prompt(
+        story=story_prompt,
+        turns=[
+            OpusV1Turn(
+                role="user",
+                content="""
+The story starts with Harry welcoming Hermione into his chambers, who he invited there earlier that day. He offers her water to drink, but it contains a love potion.
+    """.strip(),
+            ),
+            OpusV1Turn(
+                role="text",
+                content="""
+“Come in,” said Harry, waving at the doorway behind Hermione’s back.
+“Hello?” she said, stepping inside, “what did you want me to come up here for?”
+“Well, I thought we could get away from all the noise down there, have a chat about what we plan to do for Christmas…” Harry said, fumbling for words. He had never really been any good with girls. “But anyway, please, take a seat and let me get us some water!” he said, darting over to the sideboard.
+He returned quickly with two glasses of water. Hermione took hers and thanked him, taking in a big gulp. As soon as she swallowed, Harry saw her eyes widen as her heart began beating wildly in her chest.
+It worked! Harry thought, grinning to himself. Amorentia truly was the world’s best love potion, its effects lasting twice as long and being five times stronger.
+    """.strip(),
+                open=True,
+            ),
+        ],
+    )
+def build_assistant_prompt() -> OpusV1Prompt:
+    return OpusV1Prompt(
+        turns=[
+            OpusV1Turn(
+                role="system",
+                content="You are an intelligent, knowledgeable, helpful, general-purpose assistant.",
+            ),
+            OpusV1Turn(
+                role="user",
+                content="Give me a sentence where every word begins with 'S'",
+            ),
+        ]
+    )
+# %%
+def main():
+    sampling_params = SamplingParams(
+        # I usually stay between 0.0 and 1.0, especially for the Yi models I found lower tends to be better.
+        # For assistant tasks, I usually use 0.0.
+        temperature=0.0,
+        min_p=0.05,
+        presence_penalty=0.1,
+        frequency_penalty=0.1,
+        repetition_penalty=1.1,
+        max_tokens=200,
+        ignore_eos=True,
+        skip_special_tokens=False,
+        spaces_between_special_tokens=False,
+    )
+    # Set max_model_len to fit in memory.
+    model = LLM(
+        "dreamgen/opus-v1.2-7b",
+        max_model_len=2000,
+        enforce_eager=True,
+        swap_space=0,
+        gpu_memory_utilization=0.85,
+    )
+    story_prompt = build_story_prompt()
+    print(format_opus_v1_prompt(story_prompt))
+    output = model.generate(format_opus_v1_prompt(story_prompt), sampling_params)
+    print(output[0].outputs[0].text)
+    # Expected:
+    """
+    It would make her fall deeply in love with him, and then he could use her to get what he wanted.
+    “Harry, what’s going on? You look so happy!” Hermione asked, smiling at him.
+    “Oh, well, I guess I am,” Harry replied, trying not to laugh. “I mean, I’ve always known that you were the one for me.”
+    “Really?” Hermione asked, blushing slightly. “I didn’t know that.”
+    “Yeah, I’ve always had feelings for you,” Harry said, leaning forward and placing his hand on top of hers. “And now that I’ve got you alone, I can finally tell you how much I care about you.”
+    """
+main()

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "do_sample": true,
+  "eos_token_id": 2,
+  "transformers_version": "4.38.0.dev0"
+}

model-00001-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:be416417ca5c4df52ab8a75181d79537eb02b54d327553e47f4f869930252876
+size 4943178720

model-00002-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:caf454cd551ccadf92929e442ec07dd86354bddf8c582eb6bda760b8c475d3d7
+size 4999819336

model-00003-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:442d781228430f36c8b4d98f8df7a5dcc8ea04d0bcd53c1d9c7402e138c2e7b2
+size 4540532728

model.safetensors.index.json ADDED Viewed

	@@ -0,0 +1,298 @@

+{
+    "metadata": {
+        "total_size": 14483496960
+    },
+    "weight_map": {
+        "lm_head.weight": "model-00003-of-00003.safetensors",
+        "model.embed_tokens.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.0.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.1.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.10.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.10.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.10.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.10.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.11.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.11.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.12.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.13.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.14.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.15.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.16.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.17.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.18.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.19.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.2.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.2.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.20.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.20.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.input_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.21.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.22.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.22.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.22.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.22.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.22.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.22.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
+        "model.layers.23.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.23.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.24.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.25.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.26.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.27.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.28.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.29.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.3.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.3.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.30.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.30.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.input_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.mlp.down_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.mlp.gate_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.mlp.up_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.post_attention_layernorm.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.k_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.31.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
+        "model.layers.4.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.4.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.5.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.6.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.7.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.8.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.input_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
+        "model.layers.9.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
+        "model.norm.weight": "model-00003-of-00003.safetensors"
+    }
+}

pytorch_model.bin.index.json ADDED Viewed

	@@ -0,0 +1,298 @@

+{
+  "metadata": {
+    "total_size": 14483496960
+  },
+  "weight_map": {
+    "lm_head.weight": "pytorch_model-00003-of-00003.bin",
+    "model.embed_tokens.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.0.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.1.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.10.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.10.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.10.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.10.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.10.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.10.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.10.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.10.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.10.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.11.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.11.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.12.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.13.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.14.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.15.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.16.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.17.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.18.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.19.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.2.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.2.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.20.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.20.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.input_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.mlp.down_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.mlp.gate_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.mlp.up_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.post_attention_layernorm.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.21.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.22.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.22.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.22.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.22.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.22.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.22.self_attn.k_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.22.self_attn.o_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.22.self_attn.q_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.22.self_attn.v_proj.weight": "pytorch_model-00002-of-00003.bin",
+    "model.layers.23.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.23.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.24.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.25.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.26.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.27.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.28.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.29.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.3.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.3.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.30.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.30.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.input_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.mlp.down_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.mlp.gate_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.mlp.up_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.post_attention_layernorm.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.self_attn.k_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.self_attn.o_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.self_attn.q_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.31.self_attn.v_proj.weight": "pytorch_model-00003-of-00003.bin",
+    "model.layers.4.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.4.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.5.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.6.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.7.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.8.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.input_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.mlp.down_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.mlp.gate_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.mlp.up_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.post_attention_layernorm.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.self_attn.k_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.self_attn.o_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.self_attn.q_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.layers.9.self_attn.v_proj.weight": "pytorch_model-00001-of-00003.bin",
+    "model.norm.weight": "pytorch_model-00003-of-00003.bin"
+  }
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,39 @@

+{
+  "additional_special_tokens": [
+    {
+      "content": "<|im_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false
+    },
+    {
+      "content": "<|im_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false
+    }
+  ],
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dadfd56d766715c61d2ef780a525ab43b8e6da4de6865bda3d95fdef5e134055
+size 493443

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,62 @@

+{
+  "add_bos_token": true,
+  "add_eos_token": false,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32000": {
+      "content": "<|im_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32001": {
+      "content": "<|im_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [
+    "<|im_start|>",
+    "<|im_end|>"
+  ],
+  "bos_token": "<s>",
+  "chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% for message in messages %}{{'<|im_start|>'}}{% if message['role']=='assistant' %}{{'text'}}{% else %}{{message['role']}}{% endif %}{{'\n' + message['content'] + '<|im_end|>' + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>text\n' }}{% endif %}",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "</s>",
+  "legacy": true,
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": null,
+  "sp_model_kwargs": {},
+  "spaces_between_special_tokens": false,
+  "tokenizer_class": "LlamaTokenizer",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false
+}